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Caterina M.J. et al have recently determined the molecular basis underlying this 
phenomenon by characterizing a functional cDNA that encodes a vanilloid receptor 
(VR-1) in rat sensory ganglia (Caterina M. J. et al, (1997) Nature 389:816-824). VR-1 
is a vanilloid-gated ? nonselective cation channel that resembles members of the transient 
5 receptor potential (TRP) channel family, first identified as components of the 

Drosophila phototransduction pathway (Montell et al (1989) Neuron 2:1313-1323). 
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Summary of the Invention 

The present invention is based, at least in part, on the discovery of novel 
members of the Capsaicin/Vanilloid family of receptors. Described herein is the 

1 0 isolation of the human orthologue of rat VR- 1 (rVR- 1 ), referred to herein as hVR-1 , as 
well as another previously unknown member of the VR family of receptors, referred 
herein as VR-2, and specifically as human VR-2 (hVR-2, including an alternate form 
which contains a deletion) and rat VR-2 (rVR-2) nucleic acid and protein molecules. 
The hVR-1, hVR-2, and rVR-2 molecules of the present invention are useful as targets 

15 for developing modulating agents to regulate a variety of cellular processes, e.g., cellular 
processes involved in pain. Accordingly, in one aspect, this invention provides isolated 
nucleic acid molecules encoding hVR-1, hVR-2, and rVR-2 proteins and fragments 
thereof, as well as nucleic acid fragments suitable as primers or hybridization probes for 
the detection of hVR-1, hVR-2, and rVR-2-encoding nucleic acids. 

20 In one embodiment, an hVR-1, hVR-2, or rVR-2 nucleic acid molecule of the 

invention is at least 60%, 65%, 70%, 75%, 80%, 83%, 85%, 86%, 87%, 88%, 89%, 
90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more identical to the 
nucleotide sequence (e.g., to the entire length of the nucleotide sequence) shown in SEQ 
ID NO:l, 3, 4, 6, 7, 9, 10, or 12 or a complement thereof. 

25 In another embodiment, the isolated nucleic acid molecule includes the 

nucleotide sequence shown SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, or a complement 
thereof. In another embodiment, the nucleic acid molecule includes at least 10, 15, 20, 
or more contiguous nucleotides of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12. 
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In another embodiment, an hVR-1 , hVR-2, and rVR-2 nucleic acid molecule 
includes a nucleotide sequence encoding a protein having an amino acid sequence 
sufficiently homologous to the amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1. In 
one embodiment, an hVR-1, hVR-2, and rVR-2 nucleic acid molecule includes a 
5 nucleotide sequence encoding a protein having an amino acid sequence at least 60%, 
65%, 70%, 75%, 80%, 85%, 87%, 90%, 95%, 98% or more identical to the entire length 
of the amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1. 

Another embodiment of the invention features nucleic acid molecules, preferably 
hVR-1, hVR-2, and rVR-2 nucleic acid molecules, which specifically detect hVR-1, 

1 0 hVR-2, and rVR-2 nucleic acid molecules relative to nucleic acid molecules encoding 
non-hVR-1, non-hVR-2, and non-hVR-2 proteins. For example, in one embodiment, 
such a nucleic acid molecule is at least 100-150, 1 150-200, 200-250, 250-300, 300-350, 
350-400, 400-450, 450-500, 500-550, 550-600, 600-700, 700-800, 800-900, 900-1000, 
1088, or more nucleotides in length and hybridizes under stringent conditions to a 

1 5 nucleic acid molecule comprising the nucleotide sequence shown in SEQ ID NO: 1,3,4, 
6, 7, 9, 10, or 12. In preferred embodiments, the nucleic acid molecules are at least 15 
(e.g., contiguous) nucleotides in length and hybridize under stringent conditions to 
nucleotides 1-17, 3696-3863, or 3901-3909 of SEQ ID NO: 1. In other preferred 
embodiments, the nucleic acid molecules comprise nucleotides 1-17, 3696-3863, or 

20 390 1 -3909 of SEQ ID NO: 1 . In yet other preferred embodiments, the nucleic acid 
molecules consist of nucleotides 1-17, 3696-3863, or 3901-3909 of SEQ ID NO:l. In 
preferred embodiments, the nucleic acid molecules are at least 15 (e.g., contiguous) 
nucleotides in length and hybridize under stringent conditions to nucleotides 1944-2003 
of SEQ ID NO:4. In other preferred embodiments, the nucleic acid molecules comprise 

25 nucleotides 1944-2003 of SEQ ID NO:4. In yet other preferred embodiments, the 
nucleic acid molecules consist of nucleotides 1944-2003 of SEQ ID NO:4. 

In other embodiments, the nucleic acid molecule encodes a naturally occurring 
allelic variant of a polypeptide comprising the amino acid sequence of SEQ ID NO:2, 5, 
8, or 1 1, wherein the nucleic acid molecule hybridizes to a nucleic acid molecule 

30 consisting of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12 under stringent conditions and is 
encoded by the same locus as hVR-1 , hVR-2 or rVR-2. 
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an amino acid sequence at leas, about 60%, 65%, 70%, 75%, 80%. 85%. 87% 90% 
95%, 98% or more homologous to the amino acid sequence of SEQ ID N0 2 5 8 or 
1 1 - In another embodiment, the protein, preferably an hVR- 1 , hVR-2, or rVR-2 protein 

> ,e -'° ne ^yrinre P e^^ 

pan, In yet another embodiment, the protein, preferably an hVR-1, hVR-2, and rVR-2 
protein, includes at .east one transmembrane domain, at least one proline rich domain 
and at least one anJcyrin repeat domain and is encoded by a nucleic acid molecule having 
a nucleotide sequence which hybridizes under stringent hybridization conditions to a 
10 nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO- 1 3 4 6 7 9 
10,orl2. ' ' J '*'°' 7 ' 9 > 

In another embodiment, the invention features fragments of the protein having 
the ammo acid sequence of SEQ ID NO:2, 5, 8, or 1 1, wherein the fragment comprises 
atI -' 5 .30,40,50,60^ 
15 In another embodiment, the invention features an isolated protein, preferably an 

hVR- 1 , hVR-2, and rVR-2 protein, which is encoded by a nucleic acid molecule 
coring of a nucleotide sequence at least about 60%, 65%, 70%, 75%, 80% 83% 
85o/ 0 , 860/0, 87o/o, 8 8o/o, 89 o/o, 90%, 91%, 92 o/o, 93o/ 0> 94% , 95 o /o , 96 o /o> ^ ^ ^ ^ 
more homologous to a nucleotide sequence of SEQ ID NO:l, 3, 4, 6, 7, 9 10 or 12 ora 
20 complements T*t^f^^ m ^^^^' m 
hVR-1, hVR-2, or rVR-2 protein, which is encoded by a nucleic acid molecule 
coasting of a nucleotide sequence which hybridizes under stringent hybridization 
condmons to a nucleic acid molecule consisting of the nucleotide sequence of SEQ ID 
NO:l, 3, 4, 6, 7, 9, 10, or 12, or a complement thereof. 
25 ™ epr0tei - of * e P^ 

persons thereof, can be operatively linJced to a non-hVR-,, no „. h VR-2, or non-rVR-2 
polypept.de (e.g., heterologous amino acid sequence) to form fusion proteins The 
■nvention further features antibodies, such as monoclonal or polyclonal antibodies, that 
specially bind proteins of the invention, preferably hVR-1, hVR-2, and rVR-2 
30 proteins. In addition, the hVR-l.hVR-2. and rVR-2 proteins or biologically active 
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portions thereof can be incorporated into pharmaceutical compositions, which optionally 
include pharmaceutical^ acceptable carriers. 

In another aspect the present invention provides a method for detecting the 
presence of an hVR-1, hVR-2, and rVR-2 nucleic acid molecule, protein or polypeptide 
in a biological sample by contacting the biological sample with an agent capable of 
detecting an hVR-1, hVR-2, and rVR-2 nucleic acid molecule, protein or polypeptide 
such that the presence of an hVR-1. hVR-2, and rVR-2 nucleic acid molecule, protein or 
polypeptide is detected in the biological sample. 

In another aspect, the present invention provides a method for detecting the 
presence of hVR-1, hVR-2, and rVR-2 activity in a biological sample by contacting the 
biological sample with an agent capable of detecting an indicator of hVR-1, hVR-2, and 
rVR-2 activity such that the presence of hVR-1, hVR-2, and rVR-2 activity is detected 

in the biological sample. 

In another aspect, the invention provides a method for modulating hVR-1 , hVR- 
2, and rVR-2 activity comprising contacting a cell capable of expressing hVR-1, hVR-2, 
and rVR-2 with an agent that modulates hVR-1. hVR-2, and rVR-2 activity such that 
hVR-1, hVR-2, and rVR-2 activity in the cell is modulated. In one embodiment, the 
agent inhibits hVR-1, hVR-2, and rVR-2 activity. In another embodiment, the agent 
stimulates hVR-1, hVR-2, and rVR-2 activity. In one embodiment, the agent is an 
antibody that specifically binds to an hVR-1, hVR-2, and rVR-2 protein. In another 
embodiment, the agent modulates expression of hVR-1, hVR-2, and rVR-2 by 
modulating transcription of an hVR-1, hVR-2, and rVR-2 gene or translation of an hVR- 
1, hVR-2, and rVR-2 mRNA. In yet another embodiment, the agent is a nucleic acid 
molecule having a nucleotide sequence that is antisense to the coding strand of an hVR- 
1, hVR-2, and rVR-2 mRNA or an hVR-1, hVR-2, and rVR-2 gene. 

In one embodiment, the methods of the present invention are used to treat a 
subject having a disorder characterized by aberrant hVR- 1 , hVR-2, and rVR-2 protein or 
nucleic acid expression or activity by administering an agent which is an hVR-1 , hVR-2, 
and rVR-2 modulator to the subject. In one embodiment, the hVR- 1 , hVR-2, and rVR-2 
modulator is an hVR-1, hVR-2, and rVR-2 protein. In another embodiment the hVR-1, 
hVR-2, and rVR-2 modulator is an hVR-1, hVR-2, and rVR-2 nucleic acid molecule. In 
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yet another embodiment, the hVR-1, hVR-2, and rVR-2 modulator is a peptide 
peptidomimetic, or other small molecule. In a further embodiment, the disorder 
characterized by aberrant hVR-,, hVR-2, and rVR-2 protein or nucleic acid expression 
is a pain disorder, e.g., hyperalgesia. 
5 The present invention also provides a diagnostic assay for identifying the 

presence or absence of a genetic alteration characterized by at least one of (i) aberrant 
modification or mutation of a gene encoding an hVR- 1 , hV R. 2 , and rVR-2 protein- (ii) 
--regulation of the gene; and (iii) aberrant post-^slational modification of an hVR- 
1 . hVR-2, and rVR-2 protein, wherein a wild-type form of the gene encodes a protein 
10 with an hVR-1, hVR-2, and rVR-2 activity (as described herein). 

^^eraspectmeinventionprovidesamemodforidentifyingacompound 
that bmds to or modulates the activity of an hVR-1. hVR-2, and rVR-2 protein by 

proving an indicator composition comprising an hVR-1, hVR-2, and rVR-2 protein 
ha ^ hV K-^VR-2,andr^^ 

15 ^ C o m pound.anddeterminin g theeffectofthetestcompoundonhVR.l, hV R-o ^ 
rVR-2 activity in the indicator composition to identify a compound that modulated 
activity of an hVR-1 , hVR-2, and rVR-2 protein. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 

!0 

Brief Description of the Drawing s 

fl** I <kpi«s .he ft,,! leagu, cDNA sequence and p.difled amino acid 
seouenceofhuman VR., (hVR.,). Vm^M.^^^ 
^Itoa^ofSEQrDNO:,. ^^^^^^ 

of the human VR-1 (hVR-I) gene is show, in SEQ m N0 .j 

Ffrr, 2 depio the ftl, tag* CDNA sequence and predicted amino acid 
sequence of human VR-2 (»VR. 2) . The nucleotide ^ ^ 

..o^ofSEOWO* The coding ^Wthom tfc y and , un« „g iom 
of the human VR-2 (hVR-2) gene is shown in SEQ ID NO:6. 
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Figure 3 depicts the partial cDNA sequence and partial predicted amino acid 
sequence of an alternate form of human VR-2 (hVR-2). The nucleotide sequence 
corresponds to nucleic acids 1 to 1489 of SEQ ID NO:7. The amino acid sequence 
corresponds to amino acids 1 to 436 of SEQ ID NO:8. The coding region without the 5' 
5 and 3' untranslated regions of the alternate form of human VR-2 (hVR-2) gene is shown 
in SEQ ID NO:9. 

Figure 4 depicts the partial cDNA sequence and partial predicted amino acid 
sequence of rat VR-2 (rVR-2). The nucleotide sequence corresponds to nucleic acids 1 
to 1794 of SEQ ID NO: 10. The amino acid sequence corresponds to amino acids 1 to 

10 554 of SEQ ID NO: 1 1 . The coding region without the 5' and 3' untranslated regions of 
the rat VR-2 (rVR-2) gene is shown in SEQ ID NO: 12. 

Figure 5 depicts an alignment of the hVR-1 protein (SEQ ID NO:2) with the 
human VR-2 protein (SEQ ID NO:5) using the GAP program in the GCG software 
package (Blosum 62 matrix) and a gap weight of 12 and a length weight of 4. 

1 5 Figure 6 depicts an alignment of the hVR- 1 nucleotide sequence (SEQ ID NO: 1 ) 

with the human VR-2 nucleotide sequence (SEQ ID NO:4) using the GAP program in 
the GCG software package (nwsgapdna matrix) and a gap weight of 50 and a length 
weight of 3. 

Figure 7 depicts an alignment of the hVR-2 protein (SEQ ID NO:5) with the rat 
20 VR-2 protein (SEQ ID NO: 1 1 ) using the CLUSTAL W ( 1 .74) multiple sequence 
alignment program. 

Figure 8 depicts an alignment of the hVR-2 protein (SEQ ID NO:5) with the rat 
VR-2 protein (SEQ ID NO. l 1) using the GAP program in the GCG software package 
(Blosum 62 matrix) and a gap weight of 12 and a length weight of 4. 
25 Figure 9 depicts an alignment of the hVR-1 nucleotide sequence (SEQ ID NO: 1 ) 

with the rat VR-1 nucleotide sequence (Accession Number: AF0293 10) using the GAP 
program in the GCG software package (nwsgapdna matrix) and a gap weight of 50 and a 
length weight of 3. 

Figure 10 depicts an alignment of the hVR-1 protein (SEQ ID NO:2) with the rat 
30 VR-1 protein (Accession Number:AF0293 1 0) using the GAP program in the GCG 
software package (Blosum 62 matrix) and a gap weight of 12 and a length weight of 4. 
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Figure J J depicts an alignment of the hVR-2 protein (SEQ ID NO-5) with ,h 
human VR-2 protein (alternate form) (SEQ ID NOJtt ■ „ 

hVR-l protein against the HMM database. 

hv^:.^ 

hVR-2 protein against the HMM database. 

VR-2 protein (alternate form) (SEQ ID NO:20). 

Figure /depicts an alignment of the hVR-2 protein (SEQ ID N05) with th 

CLUSTAL W (, .74) multiple sequence alignment program. 

Detailed Descriprinn «f the i nven#in „ 

The present invention is based at least in n art .u _,• 
?n ^ aacu ' aueastln P^, on the discovery of nucleic acid 

25 and rVR-2 molecules were identified t • K1 ' h VR " 2 > 

<VK I). VR-1 tsavanrltad gated, non-selective cation channel 
resemWes members of the transient receptor potemia, am io T f 

— ^ »»^an,ysishas Wi ca,ed m a,ra,V R ., CO n tti n S!ix 
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transmembrane domains (predicted to be mostly a-helices) with an additional short 
hydrophobic stretch between transmembrane regions 5 and 6. The amino terminal 
hydrophilic segment contains a relatively proline rich region followed by three ankyrin 
repeat domains. The rat VR-1 is expressed in small diameter neurons within sensory 
ganglia. The present hVR-1 sequence is the human orthologue of rVR-1. As described 
in further detail infra, the human VR-1 is expressed in nodose, trigeminal sensory 
neurons, as well as in some, but not all, small dorsal root ganglion (DRG) neurons and 
in a few medium sized DRG neurons. 

The hVR-1, hVR-2, and rVR-2 molecules of the present invention play a role in 
pain signaling mechanisms. As used herein, the term "pain signaling mechanisms" 
includes the cellular mechanisms involved in the development and regulation of pain, 
e.g., pain elicited by noxious chemical, mechanical, or thermal stimuli, in a subject, e.g., 
a mammal such as a human. In mammals, the initial detection of noxious chemical, 
mechanical, or thermal stimuli, a process referred to as "nociception", occurs 
predominantly at the peripheral terminals of specialized, small diameter primary afferent 
neurons, called polymodal nociceptors. These afferent neurons transmit the information 
to the central nervous system, evoking a perception of pain or discomfort and initiating 
appropriate protective reflexes. Capsaicin/Vanilloid receptors, e.g., the hVR-1, hVR-2, 
and rVR-2 molecules of the present invention, present on these afferent neurons, are 
involved in detecting these noxious chemical, mechanical, or thermal stimuli and 
transducing this information into membrane depolarization events. Thus, the hVR-1, 
hVR-2, and rVR-2 molecules by participating in pain signaling mechanisms, can 
modulate pain elicitation and provide novel diagnostic targets and therapeutic agents to 
control pain. 

The hVR-1, hVR-2, and rVR-2 molecules provide novel diagnostic targets and 
therapeutic agents to control pain in a variety of disorders, diseases, or conditions which 
are characterized by a deregulated, e.g., upregulated or downregulated, pain response. 
For example, the hVR-1, hVR-2, and rVR-2 molecules provide novel diagnostic targets 
and therapeutic agents to control the exaggerated pain response elicited during various 
forms of tissue injury, e.g., inflammation, infection, and ischemia, usually referred to as 
hyperalgesia (described in, for example, Fields, H.L. (1987) Pain, New York:McGraw- 
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Hill). Moreover, the hVR-1, hVR-2, and rVR-2 molecules provide novel diagnostic 
targets and therapeutic agents to control pain associated with muscoloskeletal disorders, 
e.g., joint pain; tooth pain; headaches; pain associated with surgery, or neuropathic pain. 
As the hVR-1 gene maps to a region of human chromosome 17 between WI- 

5 5436 (7.7cR) and WI-6584 (1 8.9cR) (Example 6), which has been associated with 
myasthenia gravis, Smith-Magenis syndrome. CORDS, Cone-rod dysrtophy, and breast 
cancer, the hVR-1 molecule may provide novel diagnostic targets and therapeutic agents 
to treat, diagnose, or prognose these disorders or other disorders linked to this 
chromosomal region. Similarly, as the hVR-2 gene maps to a region of human 

10 chromosome 1 7 between AFMA043ZB5 (23.3 cR) and D17S721 (29.3cR) (Example 6) 
which has been associated with myasthenia gravis, Smith-Magenis syndrome, CORDS, 
Cone-rod dysrtophy, choroidal dystrophy, central areolar, and retinal cone dystrophy, 
the hVR-2 molecule may provide novel diagnostic targets and therapeutic agents to treat, 
diagnose, or prognose these disorders or other disorders linked to this chromosomal 

15 region. 

The term "family" when referring to the protein and nucleic acid molecules of 
the invention is intended to mean two or more proteins or nucleic acid molecules having 
a common structural domain or motif and having sufficient amino acid or nucleotide 
sequence homology as defined herein. Such family members can be naturally or non- 
20 naturally occurring and can be from either the same or different species. For example, a 
family can contain a first protein of human origin, as well as other, distinct proteins of 
human origin or alternatively, can contain homologues of non-human origin. Members 
of a family may also have common functional characteristics. 

For example, the family of hVR-1, hVR-2, and rVR-2 proteins comprise at least 
25 one, and preferably six "transmembrane domains." As used herein, the term 

"transmembrane domain" includes an amino acid sequence of about 15 amino acid 
residues in length which spans the plasma membrane. More preferably, a 
transmembrane domain includes about at least 20, 25, 30, 35, 40, or 45 amino acid 
residues and spans the plasma membrane. Transmembrane domains are rich in 
30 hydrophobic residues, and typically have a helical structure. In a embodiment, at least 
50%, 60%, 70%, 80%, 90%, 95% or more of the amino acid residues of a 
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example, Ketchum K.A et al (1996) FEBS Letters 378:19-26, the contents of which are 
incorporated herein by reference. 

To identify the presence of an ankyrin repeat domain in an hVR-1, hVR-2, and 
rVR-2 protein and make the determination that a protein of interest has a particular 

5 profile, the amino acid sequence of the protein is searched against a database of HMMs 
(e.g., the Pfam database, release 2.1) using the default parameters 
(http://www.sanger.ac.uk/Software^ A description of the Pfam 

database can be found in Sonhammer et al (1997) Proteins 28(3)405-420 and a detailed 
description of HMMs can be found, for example, in Gribskov et al(\99Q) Meth 

10 Enzymol 183:146-159; Gribskov et a/.(1987) Proc. Natl Acad. Scl USA 84:4355- 
4358; Kroghe/<z/.(l 994)7. Mol Biol 235:1501-1531; and Stultze/a/.(1993) Protein 
Sci 2:305-3 1 4, the contents of which are incorporated herein by reference. A search 
was performed against the HMM database resulting in the identification of three ankyrin 
repeat domains in the amino acid sequence of SEQ ID NO:2 (at about residues 201-233, 

15 248-283, and 333-361 ) and SEQ ID NO:5 (at about residues 1 62-194, 208-243, and 293- 
328). The results of the searches are set forth in Figures 13 and 1 5. 

Isolated proteins of the present invention, preferably hVR-1, hVR-2, and rVR-2 
proteins, have an amino acid sequence sufficiently identical to the amino acid sequence of 
SEQ ID NO:2, 5, 8, or 1 1 or are encoded by a nucleotide sequence sufficiently identical to 

20 SEQ ID NO: 1,3,4, 6, 7, 9, 10, or 12. As used herein, the term "sufficiently identical" 
refers to a first amino acid or nucleotide sequence which contains a sufficient or minimum 
number of identical or equivalent (e.g., an amino acid residue which has a similar side 
chain) amino acid residues or nucleotides to a second amino acid or nucleotide sequence 
such that the first and second amino acid or nucleotide sequences share common structural 

25 domains or motifs and/or a common functional activity. For example, amino acid or 

nucleotide sequences which share common structural domains have at least 30%, 40%, or 
50% identity, preferably 60% identity, more preferably 70%-80%, and even more 
preferably 90-95% identity across the amino acid sequences of the domains and contain at 
least one and preferably two structural domains or motifs, are defined herein as sufficiently 

30 identical. Furthermore, amino acid or nucleotide sequences which share at least 30%, 40%, 
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or 50%, preferably 60%, more preferably 70-80%, or 90-95% identity and share a common 
functional activity are defined herein as sufficiently identical. 

As used interchangeably herein, an "hVR-1, hVR-2, and rVR-2 activity", "biological 
activity of hVR-1, hVR-2, and rVR-2" or "functional activity of hVR-1, hVR-2, and rVR-2", 
5 refers to an activity exerted by an hVR-1 , hVR-2, and rVR-2 protein, polypeptide or nucleic 
acid molecule on an hVR-1 , hVR-2, and rVR-2 responsive cell or on an hVR-1, hVR-2, and 
rVR-2 protein substrate, as determined in vivo, or in vitro, according to standard techniques. 
In one embodiment, an hVR-1 , hVR-2, and rVR-2 activity is a direct activity, such as an 
association with an hVR-1, hVR-2, and rVR-2-target molecule. As used herein, a "target 

1 0 molecule'* or "binding partner" is a molecule with which an hVR-1 , hVR-2, and rVR-2 

protein binds or interacts in nature, such that hVR-1, hVR-2, and rVR-2-mediated function is 
achieved. An hVR-1, hVR-2, and rVR-2 target molecule can be a non-hVR-1, non-hVR-2, 
and non-rVR-2 molecule or an hVR-1, hVR-2, and rVR-2 protein or polypeptide of the 
present invention. In an exemplary embodiment, an hVR-1 , hVR-2, and rVR-2 target 

15 molecule is an hVR-1, hVR-2, and rVR-2 ligand, e.g., capsaicin. Alternatively, an hVR-1, 
hVR-2, and rVR-2 activity is an indirect activity, such as a cellular signaling activity 
mediated by interaction of the hVR-1, hVR-2, and rVR-2 protein with an hVR-1, hVR-2, and 
rVR-2 ligand. 

Accordingly, another embodiment of the invention features isolated hVR-1, 
20 hVR-2. and rVR-2 proteins and polypeptides having an hVR-1 , hVR-2, and rVR-2 
activity. Other proteins of the invention are hVR-1, hVR-2, and rVR-2 proteins having 
at least one, and preferably six, transmembrane domains and, preferably, an hVR-1, 
hVR-2, and rVR-2 activity. Yet other proteins of the invention are hVR-1, hVR-2, and 
rVR-2 proteins having at least one transmembrane domain, at least one proline rich 
25 domain and, preferably, an hVR-1, hVR-2, and rVR-2 activity. Other proteins of the 
invention are hVR-1, hVR-2, and rVR-2 proteins having at least one transmembrane 
domain, at least one proline rich domain, at least one ankyrin repeat domain and, 
preferably, an hVR-1, hVR-2, and rVR-2 activity. Additional proteins of the invention 
have at least one transmembrane domain, at least one proline rich domain, at least one 
30 ankyrin repeat domain, and are, preferably, encoded by a nucleic acid molecule having a 
nucleotide sequence which hybridizes under stringent hybridization conditions to a 
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nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 
10, or 12. 

The nucleotide sequence of the full length hVR-1 cDNA and the predicted 
amino acid sequence of the hVR-1 polypeptide are shown in Figure 1 and in SEQ ID 
5 NOs: 1 and 2, respectively. 

The nucleotide sequence of the full length hVR-2 cDNA and the predicted amino 
acid sequence of the hVR-2 polypeptide are shown in Figure 2 and in SEQ ID NOs:4 
and 5, respectively. 

The nucleotide sequence of the partial hVR-2 (alternate form) cDNA and the 
10 predicted amino acid sequence of the hVR-2 (alternate form) polypeptide are shown in 
Figure 3 and in SEQ ID NOs:7 and 8, respectively. 

The nucleotide sequence of the partial rVR-2 cDNA and the predicted amino 
acid sequence of the rVR-2 polypeptide are shown in Figure 4 and in SEQ ID NOs: 10 
and 1 1 , respectively. 

] 5 Various aspects of the invention are described in further detail in the following 

subsections: 

I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nucleic acid molecules that 

20 encode hVR-1, hVR-2, and rVR-2 proteins or biologically active portions thereof, as 
well as nucleic acid fragments sufficient for use as hybridization probes to identify hVR- 
1, hVR-2, and rVR-2-encoding nucleic acid molecules (e.g., hVR-1, hVR-2, and rVR-2 
mRNA) and fragments for use as PCR primers for the amplification or mutation of hVR- 
1, hVR-2, and rVR-2 nucleic acid molecules. As used herein, the term "nucleic acid 

25 molecule" is intended to include DNA molecules (e.g., cDNA or genomic DNA) and 
RNA molecules (e.g., mRNA) and analogs of the DNA or RNA generated using 
nucleotide analogs. The nucleic acid molecule can be single-stranded or double- 
stranded, but preferably is double-stranded DNA. 

The term "isolated nucleic acid molecule" includes nucleic acid molecules which 

30 are separated from other nucleic acid molecules which are present in the natural source 
of the nucleic acid. For example, with regards to genomic DNA, the term "isolated" 
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includes nucleic acid molecules which are separated from the chromosome with which 
the genomic DNA is naturally associated. Preferably, an "isolated" nucleic acid is free 
of sequences which naturally flank the nucleic acid (i.e., sequences located at the 5' and 
3' ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic 
5 acid is derived. For example, in various embodiments, the isolated hVR-1, hVR-2, and 
rVR-2 nucleic acid molecule can contain less than about 5 kb, 4kb, 3kb, 2kb, 1 kb, 0.5 
kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in 
genomic DNA of the cell from which the nucleic acid is derived. Moreover, an 
"isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free of 
10 other cellular material, or culture medium when produced by recombinant techniques, or 
substantially free of chemical precursors or other chemicals when chemically 
synthesized. 

A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12. Using all or 

15 portion of the nucleic acid sequence of SEQ ID NO: 1 , 3, 4, 6, 7, 9, 10, or 12, as a 

hybridization probe, hVR-1, hVR-2, and rVR-2 nucleic acid molecules can be isolated 
using standard hybridization and cloning techniques (e.g., as described in Sambrook, J., 
Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed, Cold 
Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press. Cold Spring Harbor, 

20 NY, 1989). 

Moreover, a nucleic acid molecule encompassing all or a portion of SEQ ID 
NO:l, 3, 4, 6, 7, 9, 10, or 12, can be isolated by the polymerase chain reaction (PCR) 
using synthetic oligonucleotide primers designed based upon the sequence of SEQ ID 
NO:l,3,4, 6, 7, 9,10, or 12. 

25 A nucleic acid of the invention can be amplified using cDNA, mRNA or 

alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 
according to standard PCR amplification techniques. The nucleic acid so amplified can 
be cloned into an appropriate vector and characterized by DNA sequence analysis. 
Furthermore, oligonucleotides corresponding to hVR-1 , hVR-2, and rVR-2 nucleotide 

30 sequences can be prepared by standard synthetic techniques, e.g., using an automated 
DNA synthesizer. 



WO 00/29577 PCT/US99/2670I 

- 18- 

In one embodiment, an isolated nucleic acid molecule of the invention comprises 
the nucleotide sequence shown in SEQ ID NO: 1 . The sequence of SEQ ID NO: 1 
corresponds to the full length hVR-1 encoding cDNA. 

In another embodiment, an isolated nucleic acid molecule of the invention 
5 comprises the nucleotide sequence shown in SEQ ID NO:4. The sequence of SEQ ID 
NO:4 corresponds to the full length hVR-2 encoding cDNA. 

In another embodiment, an isolated nucleic acid molecule of the invention 
comprises the nucleotide sequence shown in SEQ ID NO:7. The sequence of SEQ ID 
NO:7 corresponds to a fragment of the hVR-2 (alternate form) encoding cDNA. 
10 In another embodiment, an isolated nucleic acid molecule of the invention 

comprises the nucleotide sequence shown in SEQ ID NO: 10. The sequence of SEQ ID 
NO: 10 corresponds to a fragment of the rVR-2 cDNA. 

In another embodiment, an isolated nucleic acid molecule of the invention 
comprises a nucleic acid molecule which is a complement of the nucleotide sequence 
15 shown in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, or a portion of any of these nucleotide 
sequences. A nucleic acid molecule which is complementary to the nucleotide sequence 
shown in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, is one which is sufficiently 
complementary to the nucleotide sequence shown in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 
12, such that it can hybridize to the nucleotide sequence shown in SEQ ID NO:l, 3, 4, 6, 
20 7, 9, 1 0, or 12 thereby forming a stable duplex. 

In still another embodiment, an isolated nucleic acid molecule of the present 
invention comprises a nucleotide sequence which is at least about 60%, 65%, 70%, 75%, 
80%, 83%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 
98%, 99% or more homologous to the entire length of the nucleotide sequence shown in 
25 SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, or a portion of any of these nucleotide sequences. 

Moreover, the nucleic acid molecule of the invention can comprise only a portion 
of the nucleic acid sequence of SEQ ED NO:l, 3, 4, 6, 7, 9, 10, or 12, for example, a 
fragment which can be used as a probe or primer or a fragment encoding a portion of an 
hVR-1, hVR-2, and rVR-2 protein, e.g., a biologically active portion of an hVR-1, hVR- 
30 2, and rVR-2 protein. The nucleotide sequence determined from the cloning of the 
hVR-1, hVR-2, and rVR-2 gene allows for the generation of probes and primers 
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The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequence shown in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, due to degeneracy 
of the genetic code and thus encode the same hVR-1, hVR-2, and rVR-2 proteins as 
those encoded by the nucleotide sequence shown in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 
5 12. In another embodiment, an isolated nucleic acid molecule of the invention has a 
nucleotide sequence encoding a protein having an amino acid sequence shown in SEQ 
ID NO:2, 5, 8, or 11. 

In addition to the hVR-1, hVR-2, and rVR-2 nucleotide sequences shown in SEQ 
ID NO:l, 3, 4, 6, 7, 9, 10, or 12, it will be appreciated by those skilled in the art that 

10 DNA sequence polymorphisms that lead to changes in the amino acid sequences of the 
hVR-1, hVR-2, and rVR-2 proteins may exist within a population (e.g., the human 
population). Such genetic polymorphism in the hVR-1, hVR-2, and rVR-2 genes may 
exist among individuals within a population due to natural allelic variation. As used 
herein, the terms "gene" and "recombinant gene" refer to nucleic acid molecules which 

15 include an open reading frame encoding an hVR-1, hVR-2, and rVR-2 protein. 

preferably a mammalian hVR-1, hVR-2, and rVR-2 protein, and can further include non- 
coding regulatory sequences, and introns. 

Allelic variants of hVR-1, hVR-2, and rVR-2 include both functional and non- 
functional hVR-1, hVR-2, and rVR-2 proteins. Functional allelic variants are naturally 

20 occurring amino acid sequence variants of the hVR-1, hVR-2, and rVR-2 protein that 
maintain the ability to bind an hVR-1, hVR-2, and rVR-2 ligand and/or modulate a pain 
signaling mechanism. Functional allelic variants will typically contain only 
conservative substitution of one or more amino acids of SEQ ID NO:2, 5, 8, or 1 1, or 
substitution, deletion or insertion of non-critical residues in non-critical regions of the 

25 protein. 

Non-functional allelic variants are naturally occurring amino acid sequence 
variants of the hVR-1, hVR-2, and rVR-2 protein that do not have the ability to either 
bind an hVR-1, hVR-2, and rVR-2 ligand and/or modulate a pain signaling mechanism. 
Non-functional allelic variants will typically contain a non-conservative substitution, a 
30 deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID 
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NO:2, 5, 8, or 1 L or a substitution., insertion or deletion in critical residues or critical 
regions. 

The present invention further provides non-human orthologues of the hVR-2 and 
rVR-2 protein. Orthologues of the hVR-2 and rVR-2 protein are proteins that are 
5 isolated from non-human and non-rat organisms and possess the same hVR-2 and rVR- 
2 ligand binding and/or modulation of pain signaling mechanism capabilities of the 
hVR-2 and rVR-2 proteins. Orthologues of the hVR-2 and rVR-2 proteins can readily 
be identified as comprising an amino acid sequence that is substantially homologous to 
SEQIDNO: 4, 6, 8 or 10. 

1 0 Moreover, nucleic acid molecules encoding other hVR- 1 , hVR-2, and rVR-2 

family members and, thus, which have a nucleotide sequence which differs from the 
hVR-1, hVR-2, and rVR-2 sequences of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, are 
intended to be within the scope of the invention. For example, another hVR-1, hVR-2, 
and rVR-2 cDNA can be identified based on the nucleotide sequence of hVR-1, hVR-2, 

1 5 and rVR-2. Moreover, nucleic acid molecules encoding VR-2 proteins from different 
species, and which, thus, have a nucleotide sequence which differs from the hVR-2 and 
rVR-2 sequences of SEQ ID NO:4, 6, 8, or 10 are intended to be within the scope of the 
invention. For example, a mouse hVR-2 cDNA can be identified based on the 
nucleotide sequence of the human VR-2 (hVR-2) or the rat VR-2 (rVR-2). 

20 Nucleic acid molecules corresponding to natural allelic variants and homologues 

of the hVR-I, hVR-2, and rVR-2 cDNAs of the invention can be isolated based on their 
homology to the hVR-1, hVR-2, and rVR-2 nucleic acids disclosed herein using the 
cDNAs disclosed herein, or a portion thereof, as a hybridization probe according to 
standard hybridization techniques under stringent hybridization conditions. Nucleic acid 

25 molecules corresponding to natural allelic variants and homologues of the hVR-1, hVR- 
2, and rVR-2 cDNAs of the invention can further be isolated by mapping to the same 
chromosome or locus as the hVR-1, hVR-2, and rVR-2 gene. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 15, 20, 25, 30 or more nucleotides in length and hybridizes under 

30 stringent conditions to the nucleic acid molecule comprising the nucleotide sequence of 
SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12. In other embodiment, the nucleic acid is at least 
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additionai amino acid residues that are conserved between the hVR-1, hVR-2. and rVR-2 
proteins of the present invention and other members of the Capsaicin/Vanilloid receptor 
family are not likely to be amenable to alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
5 encoding hVR- 1 , hVR-2, and rVR-2 proteins that contain changes in amino acid residues 
that are not essential for activity. Such hVR-1, hVR-2, and rVR-2 proteins differ in 
amino acid sequence from SEQ ID NO:2. 5, 8, or 1 1, yet retain biological activity. In 
one embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence 
encoding a protein, wherein the protein comprises an amino acid sequence at least about 
10 60%, 65%, 70%, 75%, 80%, 85%, 87%, 90%, 95%, 98% or more homologous to SEQ 
ID NO:2,5,8, or 11. 

An isolated nucleic acid molecule encoding an hVR-1, hVR-2, and rVR-2 protein 
homologous to the protein of SEQ ID NO:2, 5, 8, or 1 1 can be created by introducing one 
or more nucleotide substitutions, additions or deletions into the nucleotide sequence of 
15 SEQ ID NO: I, 3, 4, 6, 7, 9, 10, or 12, such that one or more amino acid substitutions, 
additions or deletions are introduced into the encoded protein. Mutations can be 
introduced into SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, by standard techniques, such as 
site-directed mutagenesis and PCR-mediated mutagenesis. Preferably, conservative 
amino acid substitutions are made at one or more predicted non-essential amino acid 
20 residues. A "conservative amino acid substitution 11 is one in which the amino acid 

residue is replaced with an amino acid residue having a similar side chain. Families of 
amino acid residues having similar side chains have been defined in the art. These 
families include amino acids with basic side chains (e.g., lysine, arginine, histidine), 
acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., 
25 glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains 
(e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), 
beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains 
(e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted nonessential 
amino acid residue in an hVR-1, hVR-2, and rVR-2 protein is preferably replaced with 
30 another amino acid residue from the same side chain family. Alternatively, in another 
embodiment, mutations can be introduced randomly along all or part of an hVR-1, hVR- 
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*eabi.i V ,o(,) i „ 1 erac l w,,han„„. h VR. 1 ,„ 0 „. h VR. 2 ,or„„„. r v R . Jprolein „ 
e.g., a vanilloid compound such as capsaicin; (2) modular intracellular calcium 

concentrauon;(3)acn'vateanhVR.l,hVR.2 andrVR9A.„ a ■ . 

K rV R-2-deperalenl signal transduction 
pathway; or (4) modulalc a pain signaling mechanism. 

10 In addition ,o the nucleic acid molecules encoding hVR-, . hVR . 2 , md rVR ., 

proteins described above, another aspect of the invention penains to isolated nucleoid 
molecules which are antisense tea A „ ^ ^ ^ 

nucleotide sequence which is complement ,o a -sense- nucleic acid encoding a 
p ro .e,n, , ? , complement*^ „ e coding strand of adouble-amnded cDNA moleculeor 
15 complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can 

hydrogen bond » a sense nucleic acid. Tie antisense nucleic acid can be complementary 
to an enure hVR-,, hVR-2, and rVR-2 coding strand, or to only a portion thereof ,„ Z 
embodrmem, an antisense nucleic acid molecule is antisense to a -coding region" of the 
codmg strand of a nucleotide sequence erc^ding hVR., , hVR-2, and rVR-2 The term 
•coding region" refers to the region of the nucleotide scouence comprising codons which 
are translated into amino acid residues (e. S ., the coding region of hVR-1, hVR-2 and 
-VR-2). In another embodiment, the antis^se nucleic acid molecule is antisense to a 
"noncoding region" of the coding sttand of a nucleotide sequence encoding hVR-l hVR. 
2, and rVR-2. The term "noncoding region- refe* ,„ 5. a*, y sequences which flank the 
25 codmg region tha, are no, translated into amino acids (,.,, also referred ,„ as y and 3' 
untranslated regions). 

Given the coding strand sequences encoding hVR.,,hVR-2, and rVR-2 disclosed 

Watson and Crick base pairing. The antisense nucleic acid molecule can be 
30 complementary to the entire coding region of hVR-1, hVR-2, and rVR-2 mRNA but 
more preferably is „ oligonucleotide which is antisense to only a portion of the coding 



20 
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or noncoding region of hVR-1, hVR-2, and rVR-2 mRNA. For example, the antisense 
oligonucleotide can be complementary to the region surrounding the translation start site 
of hVR-1, hVR-2, and rVR-2 mRNA. An antisense oligonucleotide can be, for example, 
about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic 
5 acid of the invention can be constructed using chemical synthesis and enzymatic ligation 
reactions using procedures known in the art. For example, an antisense nucleic acid (e.g., 
an antisense oligonucleotide) can be chemically synthesized using naturally occurring 
nucleotides or variously modified nucleotides designed to increase the biological stability 
of the molecules or to increase the physical stability of the duplex formed between the 
1 0 antisense and sense nucleic acids, e.g. , phosphorothioate derivatives and acridine 
substituted nucleotides can be used. Examples of modified nucleotides which can be 
used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouraciI, 5- 
chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 
1 5 carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, I -methylguanine, 1 -methylinosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methyIguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
20 N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 

queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 
uraciI-5- oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3- 
(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, 
the antisense nucleic acid can be produced biologically using an expression vector into 
which a nucleic acid has been subcloned in an antisense orientation (i.e., RNA 
transcribed from the inserted nucleic acid will be of an antisense orientation to a target 
nucleic acid of interest, described fiirther in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered 
to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 
and/or genomic DNA encoding an hVR-1, hVR-2, and rVR-2 protein to thereby inhibit 
expression of the protein, e.g., by inhibiting transcription and/or translation. The 
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hybridization can be by conventional nucleotide complementarity to form a stable 
duplex, or, for example, in the case of an antisense nucleic acid molecule which binds to 
DNA duplexes, through specific interactions in the major groove of the double helix. 
An example of a route of administration of antisense nucleic acid molecules of the 
5 invention include direct injection at a tissue site. Alternatively, antisense nucleic acid 
molecules can be modified to target selected cells and then administered systemically. 
For example, for systemic administration, antisense molecules can be modified such that 
they specifically bind to receptors or antigens expressed on a selected cell surface, e.g., 
by linking the antisense nucleic acid molecules to peptides or antibodies which bind to 
10 cell surface receptors or antigens. The antisense nucleic acid molecules can also be 
delivered to cells using the vectors described herein. To achieve sufficient intracellular 
concentrations of the antisense molecules, vector constructs in which the antisense 
nucleic acid molecule is placed under the control of a strong pol II or pol III promoter 
are preferred. 

1 5 In yet another embodiment, the antisense nucleic acid molecule of the invention 

is an -anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual -units, the strands run parallel to each other (Gaultier et al (1987) Nucleic Acids. 
Res. 15:6625-6641). The antisense nucleic acid molecule can also comprise a 2'-o- 
methylribonucleotide (Inoue et al. (1987) Nucleic Acids Res. 15:6131-6148) or a 
chimeric RNA-DNA analogue (Inoue et al (1987) FEBSLett. 215:327-330). 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity which are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they 
have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes 
(described in Haselhoffand Gerlach (1988) Nature 334:585-591)) can be used to 
catalytically cleave hVR-1, hVR-2, and rVR-2 mRNA transcripts to thereby inhibit 
translation of hVR-1, hVR-2, and rVR-2 mRNA. A ribozyme having specificity for an 
hVR-1 , hVR-2, and rVR-2-encoding nucleic acid can be designed based upon the 
nucleotide sequence of an hVR-1, hVR-2, and rVR-2 cDNA disclosed herein (/.e, SEQ 
ID NO:l, 3, 4, 6, 7, 9, 10, or 12). For example, a derivative of a Tetrahymena L-19 IVS 



WO 00/29577 PCT/US99/26701 

-27- 

RNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved in an hVR-1, hVR-2, and rVR- 
2-encoding mRNA. See, e.g., Cech et al. U.S. Patent No. 4,987,071; and Cech et al. 
U.S. Patent No. 5,1 1 6,742. Alternatively, hVR-1, hVR-2, and rVR-2 mRNA can be 
5 used to select a catalytic RNA having a specific ribonuclease activity from a pool of 
RNA molecules. See, for example, Bartel, D. and Szostak, J.W. (1993) Science 
261:1411-1418. 

Alternatively, hVR-1, hVR-2, and rVR-2 gene expression can be inhibited by 
targeting nucleotide sequences complementary to the regulatory region of the hVR-1, 

1 0 h VR-2, and rVR-2 {e.g. , the hVR- 1 , hVR-2, and rVR-2 promoter and/or enhancers) to 
form triple helical structures that prevent transcription of the hVR-1, hVR-2, and rVR-2 
gene in target cells. See generally, Helene, C. (1991 ) Anticancer Drug Des. 6(6):569- 
84; Helene, C. et al. (1992) Ann. N. Y. Acad Sci. 660:27-36; and Maher, LJ. (1992) 
Bioassays 14(12):807-15. 

1 5 In yet another embodiment, the hVR- 1 , h VR-2, and rVR-2 nucleic acid 

molecules of the present invention can be modified at the base moiety, sugar moiety or 
phosphate backbone to improve, e.g., the stability, hybridization, or solubility of the 
molecule. For example, the deoxyribose phosphate backbone of the nucleic acid 
molecules can be modified to generate peptide nucleic acids (see Hyrup B, et al. (1996) 

20 Bioorganic & Medicinal Chemistry 4(1): 5-23). As used herein, the terms "peptide 
nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA mimics, in which the 
deoxyribose phosphate backbone is replaced by a pseudopeptide backbone and only the 
four natural nucleobases are retained. The neutral backbone of PNAs has been shown to 
allow for specific hybridization to DNA and RNA under conditions of low ionic 

25 strength. The synthesis of PNA oligomers can be performed using standard solid phase 
peptide synthesis protocols as described in Hyrup B. et al (1996) supra; Perry-O'Keefe 
et al. Proc. Natl. Acad. Sci. 93: 14670-675. 

PNAs of hVR-1, hVR-2, and rVR-2 nucleic acid molecules can be used in 
therapeutic and diagnostic applications. For example, PNAs can be used as antisense or 

30 antigene agents for sequence-specific modulation of gene expression by, for example, 
inducing transcription or translation arrest or inhibiting replication. PNAs of hVR-1 , 
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hVR-2, and ,VR-2 nU c,eic acid raolKules ca „ a|so be ujed jn ^ ^ 

patr mutattons in a gene, by PNA-direcWPCR ^ ^ 

en^mes - * combination with Cher e.ymes, SI nuc|eases 
(. 99« or as probes or primro for DNA or hybridi2aUM 

5 ""'(lW^raiPeiry-OXeefejapro). ' 
In another embodiment, PNAs of hVR-, . hVR-2, a™, ,VR-2 can be modified 

groups .0 PNA. by the formation of PNA-DNA chimeras, or by the use of liposomes or 

,0 TJZT?*°' mmy>m ~**' mL F °«™A-DNA chimeras 
.0 of hVR.,, nVR-2, and rVR-2 nucleic acid mo.ecules can be generated which may 

comb,™ the advantageous propenies of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, (e.*, RNAse H and DNA polymerases,, to interact with the DNA 
ponton wh„e the PNA potion would provide high binding affinity a» d 
PNA-DNA chimeras can he linked using linkers of appropriate lengths selected in terms 
5 of b** stackmg, number of honds between the nucleobases, a*, orientation (Hy^p B 

Hyrup B. (1996) supra and Finn P.J. « „/. (,996) Nuclelc Acids ges 2 

For example, a DNA chain can be synthesis on a solid support using ^ ' 

ptosphoramidite coupling chemistry and modified nucleoside analogs e g. 5' (4 

PNA and the 5 end of DNA (Mag, M.e/ a/, (1989) Nucleic Acid Res. ,7-5973-88). 

•* ' 5 Seement md a 3 ' DN * (Finn P.J. „„,.(, 996, supra) 

25 PNA segment^, K .„. e , o/ . (1975)^„ fc ^ c4m ^ J; „ 

'"^'"bodimen^fteoligonucbotidemayincludeomerappendedgro^s ' 
such as peptides (,*, for feting host cell receptors ,„ «*,), or ^ f 

30 PCT Publication No. W088/098 10^ or th P w^ u • , 

No wog9/inn4 , . ^/ 8I0)OrAebl °^- brainbar n^(^,^.,PCTPublication 
No. W089/10134). In add-on, oligonucleotides can be modified with hybridization- 
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triggered cleavage agents (See, e.g., Krol et ai (1988) Bio-Techniques 6:958-976) or 
intercalating agents. (See, e.g., Zon (1988) Pharm. Res. 5:539-549). To this end, the 
oligonucleotide may be conjugated to another molecule, (e.g., a peptide, hybridization 
triggered cross-linking agent, transport agent, or hybridization-triggered cleavage agent). 

5 

II. Isolated hVR-1. hVR-2, and rVR-2 Proteins and Anti-hVR-1, Anti-hVR-2. and Anti- 
rVR-2 Antibodies 

One aspect of the invention pertains to isolated hVR-1 , hVR-2, and rVR-2 
proteins, and biologically active portions thereof, as well as polypeptide fragments 

1 0 suitable for use as immunogens to raise anti-hVR-2, anti-hVR-2, and anti-rVR-2 
antibodies. In one embodiment, native hVR-1, hVR-2, and rVR-2 proteins can be 
isolated from ceils or tissue sources by an appropriate purification scheme using 
standard protein purification techniques. In another embodiment, hVR-1 , hVR-2, and 
rVR-2 proteins are produced by recombinant DNA techniques. Alternative to 

15 recombinant expression, an hVR-1, hVR-2, and rVR-2 protein or polypeptide can be 
synthesized chemically using standard peptide synthesis techniques. 

An "isolated" or "purified" protein or biologically active portion thereof is 
substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the hVR-1, hVR-2, and rVR-2 protein is derived, or 

20 substantially free from chemical precursors or other chemicals when chemically 

synthesized. The language "substantially free of cellular material" includes preparations 
of hVR-1, hVR-2, and rVR-2 protein in which the protein is separated from cellular 
components of the cells from which it is isolated or recombinantly produced. In one 
embodiment, the language "substantially free of cellular material" includes preparations 

25 of hVR-1, hVR-2, and rVR-2 protein having less than about 30% (by dry weight) of 
non-hVR-1, hVR-2, and rVR-2 protein (also referred to herein as a "contaminating 
protein"), more preferably Jess than about 20% of non-hVR-1, hVR-2, and rVR-2 
protein, still more preferably less than about 10% of non-hVR-1, hVR-2, and rVR-2 
protein, and most preferably less than about 5% non-hVR-1, non-hVR-2, and non-rVR-2 

30 protein. When the hVR-1, hVR-2, and rVR-2 protein or biologically active portion 
thereof is recombinantly produced, it is also preferably substantially free of culture 
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medium, i.e., culture medium represents less than about 20%, more preferably less than 
about 10%, and most preferably less than about 5% of the volume of the protein 
preparation. 

The language "substantially free of chemical precursors or other chemicals" 
5 includes preparations of hVR-1 , hVR-2, and rVR-2 protein in which the protein is 
separated from chemical precursors or other chemicals which are involved in the 
synthesis of the protein. In one embodiment, the language "substantially free of 
chemical precursors or other chemicals" includes preparations of hVR-1, hVR-2, and 
rVR-2 protein having less than about 30% (by dry weight) of chemical precursors or 
1 0 non-h VR- 1 , hVR-2, and rVR-2 chemicals, more preferably less than about 20% 

chemical precursors or non-hVR-1, hVR-2, and rVR-2 chemicals, still more preferably 
less than about 10% chemical precursors or non-hVR-L hVR-2, and rVR-2 chemicals, 
and most preferably less than about 5% chemical precursors or non-hVR-1, hVR-2, and 
rVR-2 chemicals. 

1 5 As used herein, a "biologically active portion" of an hVR- 1 , hVR-2, and rVR-2 

protein includes a fragment of an hVR-1, hVR-2, and rVR-2 protein which participates 
in an interaction between an hVR-1 , hVR-2, and rVR-2 molecule and a non-h VR-1 , 
non-hVR-2, and non-rVR-2 molecule, respectively. Biologically active portions of an 
hVR-1, hVR-2, and rVR-2 protein include peptides comprising amino acid sequences 

20 sufficiently homologous to or derived from the amino acid sequence of the hVR-1 , hVR- 
2, and rVR-2 protein, e.g., the amino acid sequence shown in SEQ ID NO:2, 5, 8, or 1 1, 
which include less amino acids than the full length hVR-1, hVR-2, and rVR-2 proteins, 
and exhibit at least one activity of an hVR-1 , hVR-2, and rVR-2 protein. Typically, 
biologically active portions comprise a domain or motif with at least one activity of the 

25 hVR- 1 , hVR-2, and rVR-2 protein, e.g. , binding of an hVR- 1 , hVR-2, and rVR-2 ligand 
such as a vanilloid compound, e.g., Capsaicin. A biologically active portion of an hVR- 
1 , hVR-2, and rVR-2 protein can be a polypeptide which is, for example, 1 0, 20, 30, 40, 
50, 60, 70, 80, 90, 100, 200 or more amino acids in length. Biologically active portions 
of an hVR-1, hVR-2, and rVR-2 protein can be used as targets for developing agents 

30 which modulate an hVR- 1 , hVR-2, and rVR-2 mediated activity, e.g. , a pain signaling 
mechanism. 
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In one embodiment, a biologically active portion of an hVR-1, hVR-2, and rVR- 
2 protein comprises at least one transmembrane domain, and/or at least one proline rich 
domain, and/or at least one ankyrin repeat domain. It is to be understood that a 
biologically active portion of an hVR-1, hVR-2, and rVR-2 protein of the present 
5 invention may contain at least one of the above-identified structural domains. A more 
biologically active portion of an hVR-1, hVR-2, and rVR-2 protein may contain at least 
two of the above-identified structural domains. Moreover, other biologically active 
portions, in which other regions of the protein are deleted, can be prepared by 
recombinant techniques and evaluated for one or more of the functional activities of a 

1 0 native hVR- 1 f hVR-2, and rVR-2 protein. 

In a embodiment, the hVR-1, hVR-2. and rVR-2 protein has an amino acid 
sequence shown in SEQ ID NO:2, 5, 8, or 1 1. In other embodiments, the hVR-1, hVR- 
2, and rVR-2 protein is substantially homologous to SEQ ID NO:2, 5, 8, or 1 1, and 
retains the functional activity of the protein of SEQ ID NO:2, 5, 8, or 1 1, yet differs in 

1 5 amino acid sequence due to natural allelic variation or mutagenesis, as described in 
detail in subsection I above. Accordingly, in another embodiment, the hVR-1, hVR-2, 
and rVR-2 protein is a protein which comprises an amino acid sequence at least about 
60%, 65% 70%, 75%, 80%, 85%, 87%, 90%, 95%, 98% or more homologous to SEQ 
ID NO:2, 5, 8, or 11. 

20 To determine the percent identity of two amino acid sequences or of two nucleic 

acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps 
can be introduced in one or both of a first and a second amino acid or nucleic acid 
sequence for optimal alignment and non-homologous sequences can be disregarded for 
comparison purposes). In a embodiment, the length of a reference sequence aligned for 

25 comparison purposes is at least 30%, preferably at least 40%, more preferably at least 
50%, even more preferably at least 60%, and even more preferably at least 70%, 80%, or 
90% of the length of the reference sequence (e.g., when aligning a second sequence to 
the hVR-1, hVR-2, and rVR-2 amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1, having 
177 amino acid residues, at least 80, preferably at least 100, more preferably at least 120, 

30 even more preferably at least 140, and even more preferably at least 1 50, 1 60 or 1 70 
amino acid residues are aligned). The amino acid residues or nucleotides at 
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corresponding amino acid positions or nucleotide positions are then compared. When a 
position in the first sequence is occupied by the same amino acid residue or nucleotide 
as the corresponding position in the second sequence, then the molecules are identical at 
that position (as used herein amino acid or nucleic acid "identity" is equivalent to amino 
acid or nucleic acid "homology"). The percent identity between the two sequences is a 
function of the number of identical positions shared by the sequences, taking into 
account the number of gaps, and the length of each gap, which need to be introduced for 
optimal alignment of the two sequences. 

The comparison of sequences and determination of percent identity between two 
sequences can be accomplished using a mathematical algorithm. In a embodiment, the 
percent identity between two amino acid sequences is determined using the Needleman 
and Wunsch (./ Mol Biol (48):444-453 (1970)) algorithm which has been incorporated 
into the GAP program in the GCG software package (available at http://www.gcg.com), 
using either a Blossum 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 
10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another embodiment, the 
percent identity between two nucleotide sequences is determined using the GAP 
program in the GCG software package (available at http://www.gcg.com), using a 
NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight 
of 1 , 2, 3, 4, 5, or 6. In another embodiment, the percent identity between two amino 
acid or nucleotide sequences is determined using the algorithm of E. Meyers and W. 
Miller (CABIOS, 4:1 1-17 (1989)) which has been incorporated into the ALIGN program 
(version 2.0), using a PAM120 weight residue table, a gap length penalty of 12 and a 
gap penalty of 4. 

The nucleic acid and protein sequences of the present invention can further be 
used as a "query sequence" to perform a search against public databases to, for example, 
identify other family members or related sequences. Such searches can be performed 
using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al (1990) J. 
Mol Biol 215:403-10. BLAST nucleotide searches can be performed with the 
NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences 
homologous to hVR-1, hVR-2, and rVR-2 nucleic acid molecules of the invention. 
BLAST protein searches can be performed with the XBLAST program, score = 50, 
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wordlength = 3 to obtain amino acid sequences homologous to hVR-1, hVR-2, and rVR- 
2 protein molecules of the invention. To obtain gapped alignments for comparison 
purposes. Gapped BLAST can be utilized as described in Altschul et al, (1997) Nucleic 
Acids Res. 25(1 7):3389-3402. When utilizing BLAST and Gapped BLAST programs, 
5 the default parameters of the respective programs (e.g., XBLAST and NBLAST) can be 
used. See http://www.ncbi.nlm.nih.gov. 

The invention also provides hVR-1, hVR-2, and rVR-2 chimeric or fusion 
proteins. As used herein, an hVR-1, hVR-2, and rVR-2 "chimeric protein" or "fusion 
protein" comprises an hVR-1, hVR-2, and rVR-2 polypeptide operatively linked to a 

1 0 non-hVR- 1 , hVR-2, and rVR-2 polypeptide. An M hVR- 1 , hVR-2, and rVR-2 

polypeptide" refers to a polypeptide having an amino acid sequence corresponding to 
hVR-1, hVR-2, and rVR-2, whereas a n non-hVR-l, non-hVR-2, and non-rVR-2 
polypeptide" refers to a polypeptide having an amino acid sequence corresponding to a 
protein which is not substantially homologous to the hVR-l, hVR-2, and rVR-2 protein, 

15 e.g., a protein which is different from the hVR-1, hVR-2, and rVR-2 protein and which 
is derived from the same or a different organism. Within an hVR-1 , hVR-2, and rVR-2 
fusion protein the hVR-1 , hVR-2, and rVR-2 polypeptide can correspond to all or a 
portion of an hVR-1, hVR-2, and rVR-2 protein. In a embodiment, an hVR-1, hVR-2, 
and rVR-2 fusion protein comprises at least one biologically active portion of an hVR-1, 

20 hVR-2, and rVR-2 protein. In another embodiment an hVR-1, hVR-2, and rVR-2 
fusion protein comprises at least two biologically active portions of an hVR-1, hVR-2, 
and rVR-2 protein. Within the fusion protein, the term "operatively linked" is intended 
to indicate that the hVR-1, hVR-2, and rVR-2 polypeptide and the non-hVR-1, non- 
hVR-2, and non-rVR-2 polypeptide are fused in-frame to each other. The non-hVR-1, 

25 hVR-2, and rVR-2 polypeptide can be fused to the N-terminus or C-terminus of the 
hVR-1 , hVR-2, and rVR-2 polypeptide. 

For example, in one embodiment, the fusion protein is a GST-hVR-1, GST- 
hVR-2, and GST-rVR-2 fusion protein in which the hVR-1, hVR-2, and rVR-2 
sequences are fused to the C-terminus of the GST sequences. Such fusion proteins can 

30 facilitate the purification of recombinant hVR-1, hVR-2, and rVR-2. 
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In another embodiment, the fusion protein is an hVR-1, hVR-2. and rVR-2 
protein containing a heterologous signal sequence at its N-terminus. In certain host cells 
(e.g., mammalian host cells), expression and/or secretion of hVR-1, hVR-2, and rVR-2 
can be increased through use of a heterologous signal sequence. 
5 The hVR-1 , hVR-2, and rVR-2 fusion proteins of the invention can be 

incorporated into pharmaceutical compositions and administered to a subject in vivo. 
The hVR-l, hVR-2, and rVR-2 fusion proteins can be used to affect the bioavailability 
of an hVR-1, hVR-2, and rVR-2 substrate. Use of hVR-1, hVR-2, and rVR-2 fusion 
proteins may be useful therapeutically for the treatment of disorders caused by, for 

1 0 example, (i) aberrant modification or mutation of a gene encoding an hVR- 1 , hVR-2, 
and rVR-2 protein; (ii) mis-regulation of the hVR-h hVR-2, and rVR-2 gene; and (iii) 
aberrant post-translational modification of an hVR-1, hVR-2, and rVR-2 protein. 

Moreover, the hVR-1, hVR-2, and rVR-2-fusion proteins of the invention can be 
used as immunogens to produce anti-hVR-1. anti-hVR-2, and anti-rVR-2 antibodies in a 

1 5 subject, to purify hVR-1 , hVR-2, and rVR-2 ligands and in screening assays to identify 
molecules which inhibit the interaction of hVR-1, hVR-2. and rVR-2 with an hVR-1, 
hVR-2, and rVR-2 substrate. 

Preferably, an hVR-1, hVR-2, and rVR-2 chimeric or fusion protein of the 
invention is produced by standard recombinant DNA techniques. For example, DNA 

20 fragments coding for the different polypeptide sequences are ligated together in-frame in 
accordance with conventional techniques, for example by employing blunt-ended or 
stagger-ended termini for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase 
treatment to avoid undesirable joining, and enzymatic ligation. In another embodiment, 

25 the fusion gene can be synthesized by conventional techniques including automated 
DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried 
out using anchor primers which give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be annealed and reamplified to 
generate a chimeric gene sequence (see, for example, Current Protocols in Molecular 

30 Biology, eds. Ausubel et al. John Wiley & Sons: 1992). Moreover, many expression 
vectors are commercially available that already encode a fusion moiety (e.g., a GST 
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polypeptide). An hVR-1, hVR-2, and rVR-2 -encoding nucleic acid can be cloned into 
such an expression vector such that the fusion moiety is linked in-frame to the hVR-1, 
hVR-2, and rVR-2 protein. 

The present invention also pertains to variants of the hVR-I, hVR-2, and rVR-2 
5 proteins which function as either hVR-1 , hVR-2, and rVR-2 agonists (rnimetics) or as 
hVR-1, hVR-2, and rVR-2 antagonists. Variants of the hVR-1, hVR-2 ; and rVR-2 
proteins can be generated by mutagenesis, e.g. y discrete point mutation or truncation of 
an hVR-1, hVR-2, and rVR-2 protein. An agonist of the hVR-1, hVR-2, and rVR-2 
proteins can retain substantially the same, or a subset, of the biological activities of the 

10 . naturally occurring form of an hVR-1 , hVR-2, and rVR-2 protein. An antagonist of an 
hVR-1, hVR-2, and rVR-2 protein can inhibit one or more of the activities of the 
naturally occurring form of the hVR-1, hVR-2, and rVR-2 protein by ; for example, 
competitively modulating an hVR-1, hVR-2, and rVR-2-mediated activity of an hVR-1, 
hVR-2, and rVR-2 protein. Thus, specific biological effects can be elicited by treatment 

15 with a variant of limited function. In one embodiment, treatment of a subject with a 
variant having a subset of the biological activities of the naturally occurring form of the 
protein has fewer side effects in a subject relative to treatment with the naturally 
occurring form of the hVR-1, hVR-2, and rVR-2 protein. 

In one embodiment, variants of an hVR-1, hVR-2, and rVR-2 protein which 

20 function as either hVR- 1 , hVR-2, and rVR-2 agonists (rnimetics) or as h VR- 1 , hVR-2, 
and rVR-2 antagonists can be identified by screening combinatorial libraries of mutants, 
e.&, truncation mutants, of an hVR-1, hVR-2, and rVR-2 protein for hVR-1, hVR-2, and 
rVR-2 protein agonist or antagonist activity. In one embodiment, a variegated library of 
hVR-1, hVR-2, and rVR-2 variants is generated by combinatorial mutagenesis at the 

25 nucleic acid level and is encoded by a variegated gene library. A variegated library of 
hVR-1 , hVR-2, and rVR-2 variants can be produced by, for example, enzymatically 
ligating a mixture of synthetic oligonucleotides into gene sequences such that a 
degenerate set of potential hVR-1, hVR-2, and rVR-2 sequences is expressible as 
individual polypeptides, or alternatively, as a set of larger fusion proteins (e.g., for phage 

30 display) containing the set of hVR-1, hVR-2, and rVR-2 sequences therein. There are a 
variety of methods which can be used to produce libraries of potential hVR-1, hVR-2, 
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and rVR-2 variants from a degenerate oligonucleotide sequence. Chemical synthesis of 
a degenerate gene sequence can be performed in an automatic DNA synthesizer, and the 
synthetic gene then ligated into an appropriate expression vector. Use of a degenerate 
set of genes allows for the provision, in one mixture, of all of the sequences encoding 
5 the desired set of potential hVR- 1 , hVR-2, and rVR-2 sequences. Methods for 

synthesizing degenerate oligonucleotides are known in the art (see, e.g., Narang, S.A. 
(1983) Tetrahedron 39:3; Itakura et al (1984) Annu. Rev. Biochem. 53:323; Itakura et 
al. (WW) Science 198:1056; Ike eial (1983) Nucleic Acid Res. 11:477. 

In addition, libraries of fragments of an hVR-1, hVR-2, and rVR-2 protein 

10 coding sequence can be used to generate a variegated population of hVR-1, hVR-2, and 
rVR-2 fragments for screening and subsequent selection of variants of an hVR-1, hVR- 
2, and rVR-2 protein. In one embodiment, a library of coding sequence fragments can 
be generated by treating a double stranded PCR fragment of an hVR-1, hVR-2, and 
rVR-2 coding sequence with a nuclease under conditions wherein nicking occurs only 

15 about once per molecule, denaturing the double stranded DNA, renaturing the DNA to 
form double stranded DNA which can include sense/antisense pairs from different 
nicked products, removing single stranded portions from reformed duplexes by 
treatment with SI nuclease, and ligating the resulting fragment library into an expression 
vector. By this method, an expression library can be derived which encodes N-terminal, 

20 C-terminal and internal fragments of various sizes of the hVR-1 , hVR-2, and rVR-2 
protein. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. Such techniques are adaptable for 

25 rapid screening of the gene libraries generated by the combinatorial mutagenesis of 
hVR-1, hVR-2, and rVR-2 proteins. The most widely used techniques, which are 
amenable to high through-put analysis, for screening large gene libraries typically 
include cloning the gene library into replicable expression vectors, transforming 
appropriate cells with the resulting library of vectors, and expressing the combinatorial 

30 genes under conditions in which detection of a desired activity facilitates isolation of the 
vector encoding the gene whose product was detected. Recrusive ensemble mutagenesis 
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(REM), a new technique which enhances the frequency of functional mutants in the 
libraries, can be used in combination with the screening assays to identify hVR-1, hVR- 
2, and rVR-2 variants (Arkin and Yourvan (1992) Proc. Natl. Acad. ScL USA 59:781 1- 
7815; Delgrave et al (1993) Protein Engineering 6(3):327-331). 
5 In one embodiment, cell based assays can be exploited to analyze a variegated 

hVR-1 ? hVR-2, and rVR-2 library. For example, a library of expression vectors can be 
transfected into a cell line, e.g., a neuronal cell line, which ordinarily responds to a 
particular ligand in an hVR-1, hVR-2, and rVR-2-dependent manner. The transfected 
cells are then contacted with the ligand and the effect of expression of the mutant on 

10 signaling by the ligand can be detected, e.g., by measuring intracellular calcium 

concentration, neuronal membrane depolarization, or the activity of an hVR-1, hVR-2 9 
and rVR-2-regulated transcription factor. Plasmid DNA can then be recovered from the 
cells which score for inhibition, or alternatively, potentiation of signaling by the ligand, 
and the individual clones further characterized. 

15 An isolated hVR-1 , hVR-2, and rVR-2 protein, or a portion or fragment thereof, 

can be used as an immunogen to generate antibodies that bind hVR-1, hVR-2, and rVR- 
2 using standard techniques for polyclonal and monoclonal antibody preparation. A 
full-length hVR-1, hVR-2, and rVR-2 protein can be used or, alternatively, the invention 
provides antigenic peptide fragments of hVR-1, hVR-2, and rVR-2 for use as 

20 immunogens. The antigenic peptide of hVR-1 , hVR-2, and rVR-2 comprises at least 8 
amino acid residues of the amino acid sequence shown in SEQ ID NO:2, 5, 8, or 1 1 and 
encompasses an epitope of hVR-1, hVR-2, and rVR-2 such that an antibody raised 
against the peptide forms a specific immune complex with hVR-1, hVR-2, and rVR-2. 
Preferably, the antigenic peptide comprises at least 10 amino acid residues, more 

25 preferably at least 1 5 amino acid residues, even more preferably at least 20 amino acid 
residues, and most preferably at least 30 amino acid residues. 

Epitopes encompassed by the antigenic peptide are regions of hVR-1, hVR-2, 
and rVR-2 that are located on the surface of the protein, e.g., hydrophilic regions, as 
well as regions with high antigenicity (see, for example, Figures 12 and 14). 
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by mazing a suitable subject. (,*., rabbi,, goat , mo|Ise „ ^ mamma|) ^ fc 
■mmunogen. An appropriate immunogenic CM for 

recombinandy expressed hVR- ,, hVR-2, m d , V R. 2 pr01 e in or , chemkslly ^ 
5 »VR-l,hVR.2,andrVR-2po,ypepdde. The p.para.ion can fml be r indude an 
adjuvant such as Freund's complete or incomplete adjuvant, or state 
immunosdmttlatory agent, tanunizadon of a suable subject with an immunogenic 
hVR-I, hVR-2, and ,VR-2 preparation indues a polyclonal ami-hVR-1, and-hVR-? 
and anti-rVR-2 antibody response. 
10 Accordingly, another aspect of the invention pertains to ami-hVR-1 anti-hVR 2 

and anti-rVR-2 antibodies. The term "antibody" as used herein refers to 
.mmunoglobulin molecules and immunologically active portions of immunoglobulin 
molecules, molecu.es that contain an antigen binding site which specifically binds 
(unmunoreacts with) an antigen, such as hVR-1 , hVR-2, and rVR-2. Examples of 
15 analogically active portions of immunoglobulin molecules include F( ab ) and 

F(ab') 2 fragments which can be generated by treating the antibody with an enzyme such 
as pepsm. The invention provides polyclonal and monoclonal antibodies that bind hVR- 
1, hVR-2, and rVR-2. The term "monoclonal antibody" or "monoclonal antibody 
composition", as used herein, refers to a population of antibody modules that contain 
20 only one species of an antigen binding site capable of immunoreacting with a particular 
epitope of hVR-,, hV R-2, and rVR-2. A monoclonal antibody composition thus 
really displays a single binding affinity for a particular hVR-,, hVR-2, and rVR-2 
protein with which it immunoreacts. 

as descnbed above by immunizing a suable subject with an hVR-1, hVR-2 and rVR 2 
■mmunogen. The and-hVR-!, anti-hVR-2, and anti-rVR.2 antibody titer in the 

enzyme linked immunosorbent assay (ELISA) using immobilized hVR-1 hVR-2 and 
rVR-2. If desired, the andbody molecules directed against hVR.,, hVR-2, and rVR-2 
can be isolated from me mamma, (e.g., ftom the blood) and fcrther purified by well 
known techniques, such as protein A chromatography to obtain the IgG fraction. At an 
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appropriate time after immunization, e.g., when the anti-hVR-1 , anti-hVR-2, and anti- 
rVR-2 antibody titers are highest antibody-producing cells can be obtained from the 
subject and used to prepare monoclonal antibodies by standard techniques, such as the 
hybridoma technique originally described by Kohler and Milstein (1975) Nature 
5 256:495-497) (see also, Brown et at. (1981)J. Immunol 127:539-46; Brown et al 
(1980) J. Biol. Chem .255:4980-83; Yeh et at. (1976) Proc. Natl Acad. Sci. USA 
76:2927-3 1 ; and Yeh et al (1 982) Int. J. Cancer 29:269-75), the more recent human B 
* cell hybridoma technique (Kozbor et al (1 983) Immunol Today 4:72), the EBV- 
hybridoma technique (Cole et al (1985), Monoclonal Antibodies and Cancer Therapy, 

10 Alan R. Liss, Inc., pp. 77-96) or trioma techniques. The technology for producing 
monoclonal antibody hybridomas is well known (see generally R. H. Kenneth, in 
Monoclonal Antibodies: A New Dimension In Biological Analyses, Plenum Publishing 
Corp., New York, New York (1980); E. A. Lerner (1981) Yale J. Biol Med t 54:387402; 
M. L. Gefter et al (1977) Somatic Cell Genet. 3:23136). Briefly, an immortal cell line 

15 (typically a myeloma) is fused to lymphocytes (typically splenocytes) from a mammal 
immunized with an hVR-1 , hVR-2, and rVR-2 immunogen as described above, and the 
culture supernatants of the resulting hybridoma cells are screened to identify a 
hybridoma producing a monoclonal antibody that binds hVR-1, hVR-2, and rVR-2. 
Any of the many well known protocols used for fusing lymphocytes and 

20 immortalized cell lines can be applied for the purpose of generating an anti-hVR-1, anti- 
hVR-2, and anti-rVR-2 monoclonal antibodies (see, e.g., G. Galfre et al (1977) Nature 
266:55052; Gefter et al Somatic Cell Genet, cited supra; Lerner, Vale J. Biol Med, 
cited supra; Kenneth, Monoclonal Antibodies, cited supra). Moreover, the ordinarily 
skilled worker will appreciate that there are many variations of such methods which also 

25 would be useful. Typically, the immortal cell line {e.g., a myeloma cell line) is derived 
from the same mammalian species as the lymphocytes. For example, murine 
hybridomas can be made by fusing lymphocytes from a mouse immunized with an 
immunogenic preparation of the present invention with an immortalized mouse cell line, 
immortal cell lines are mouse myeloma cell lines that are sensitive to culture medium 

30 containing hypoxanthine, aminopterin and thymidine ("HAT medium"). Any of a 
number of myeloma cell lines can be used as a ftision partner according to standard 
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.... WSM^,, M . x « 3 -A g8 .653 0 rS P 2/O.A 8 ,4 myeloma lilKs . 

656 mye ' 0m * ' ineS " avai,able *™ ATCC. Typically, HAT-sansiliva mouse 
myeloma calls arc fijsad to mousa splenocytes ^ 8|ycoJ ( , pEG . ) 

Hybridoma cells resulting fa. 4a tai „„ „ „,„ ^ ^ ^ ^ ^ 
5 talk unfcsad and mprod ucUvaly iused myeloma calls (unfaed splays die ate 
several days because they are noi transformed). Hybridoma calls pacing a 
monoclonal antibody of the invention „ detKled by fc ^ 

supe™ fo, andbodies fa. bind hVR-1, „VR-2, and rVR-2, using , ^ 

ELISA assay. 

10 Alternative to preparing monoclonal antibody-secreting hybridomas a 

monoclonal anti-hVR- , , anti-hVR-2, and anti-rVR-2 antibody can be identified and 
.solated by screening a recombinant combinatorial immunoglobulin library (e g. an 
antibody phage display library) with hVR-1, hVR-2, and rVR-2 to thereby isolate 
immunoglobulin library members that bind hVR-1, hVR-2, and rVR-2 Kits for 
15 generating and screening phage display libraries are commercially available (e g. the 
Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-0 1 • and the 
Stratagene SurJZAP™ Phage Display Kit, Cafclog No. 240612). Additionally, examples 
of methods and reagents particularly amenable for use in generating and screening 
anybody display library can be found in, for example, Ladner */. U.S. Patent No 
20 5,223,409: Kang al PCX International Publication No. WO 92/1 861 9: Dower et al. 
PCT International Publication No. WO 91/17271; Winter et al. PCT International 
Publication WO 92/20791 ; Markland et al. PCT International Publication No WO 
92/15679; Breitling et al. PCT International Publication WO 93/01288; McCafferty et 
al. PCT International Publication No. WO 92/01 047; Ganard et al. PCT International 
25 ^on No. WO 92^ wq 

90/02809; Fuchs**/. (1991) Bio/Technology 9:1370-1372; H & y et al. (1992)*™ 
An ^^omas3:SUB5;Hus^ 

0993)^12:725-734; Hawkins (1992)7. Mol. Biol. 226:889-896; Clarke 
etal. (1991) Nature 352:624-628; Gram etal. (1992) Proc. Natl. Acad Sci USA 
30 89:3576-3580;Garrade, O /. (1991)^^9:1373-1377; Hoogenboom * «/ 
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(1991) Nuc. Acid Res. 19:4133-4137; Barbas etal. (1991) Proc. Natl. Acad. Sci. USA 
88:7978-7982; and McCafferty et al Nature (1990) 348:552-554. 

Additionally, recombinant anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies, 
such as chimeric and humanized monoclonal antibodies, comprising both human and 
5 non-human portions, which can be made using standard recombinant DNA techniques, 
are within the scope of the invention. Such chimeric and humanized monoclonal 
antibodies can be produced by recombinant DNA techniques known in the art, for 
example using methods described in Robinson et al International Application No. 
PCT/US86/02269; Akira, etal. European Patent Application 184,187; Taniguchi, M„ 
0 European Patent Application 1 71 ,496; Morrison et al European Patent Application 1 
173,494; Neuberger et al. PCT International Publication No. WO 86/01533; Cabilly et 
al U.S. Patent No. 4,816,567; Cabilly et al. European Patent Application 125,023; 
Better et al. (1988) Science 240:1041-1043; Liu etal (1987) Proc. Natl Acad. Sci. USA 
84:3439-3443; Liu et al (1987) J. Immunol 139:3521-3526; Sun et al (1987) Proc. 
5 Natl Acad ScL USA 84:214-218; Nishimura et al. (1987) Cane. Res. 47:999-1005; 1 
Wood etal (1985) Nature 314:446-449; and Shaw et al (1988) J. Afa//. Cancer Inst. 
80:1553-1559); Morrison, S. L. (1985) Science 229:1202-1207; Oi et al (1986) 
BioTechniques 4:214; Winter U.S. Patent 5,225,539; Jones et al. (1986) Nature 
321:552-525; Verhoeyan et al (1988) Science 239:1534; and Beidler et al (1988)7. 
0 Immunol 141:4053-4060. 2 
An anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibody (e.g., monoclonal 
antibody) can be used to isolate hVR-1, hVR-2, and rVR-2 by standard techniques, such 
as affinity chromatography or immunoprecipitation. An anti-hVR-1, anti-hVR-2, and 
anti-rVR-2 antibody can facilitate the purification of natural hVR-1, hVR-2, and rVR-2 
5 from cells and of recombinantly produced hVR- 1 , hVR-2, and rVR-2 expressed in host 2 
cells. Moreover, an anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibody can be used to 
detect hVR-1 , hVR-2, and rVR-2 protein (e.g., in a cellular lysate or cell supernatant) in 
order to evaluate the abundance and pattern of expression of the hVR-1 , hVR-2, and 
rVR-2 protein. Anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies can be used 
0 diagnostically to monitor protein levels in tissue as part of a clinical testing procedure, 3> 
e.g., to, for example, determine the efficacy of a given treatment regimen. Detection can 
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be facilitated by coupling (/.<?., physically linking) the antibody to a detectable 
substance. Examples of detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, bioluminescent materials, and 
radioactive materials. Examples of suitable enzymes include horseradish peroxidase, 
5 alkaline phosphatase, -galactosidase, or acetylcholinesterase; examples of suitable 
prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples of 
suitable fluorescent materials include umbelliferone, fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or 
phycoerythrin; an example of a luminescent material includes luminol; examples of 
10 bioluminescent materials include luciferase, luciferin, and aequorin, and examples of 
suitable radioactive material include 125 I, 13 35 S or 3 H. 

III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression 

1 5 vectors, containing a nucleic acid encoding an h VR- 1 , hVR-2, and rVR-2 protein (or a 
portion thereof). As used herein, the term "vector" refers to a nucleic acid molecule 
capable of transporting another nucleic acid to which it has been linked. One type of 
vector is a "plasmid", which refers to a circular double stranded DNA loop into which 
additional DNA segments can be ligated. Another type of vector is a viral vector, 

20 wherein additional DNA segments can be ligated into the viral genome. Certain vectors 
are capable of autonomous replication in a host cell into which they are introduced (e.g., 
bacterial vectors having a bacterial origin of replication and episomal mammalian 
vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the 
genome of a host cell upon introduction into the host cell, and thereby are replicated 

25 along with the host genome. Moreover, certain vectors are capable of directing the 
expression of genes to which they are operatively linked. Such vectors are referred to 
herein as "expression vectors". In general, expression vectors of utility in recombinant 
DNA techniques are often in the form of plasmids. In the present specification, 
"plasmid" and "vector" can be used interchangeably as the plasmid is the most 

30 commonly used form of vector. However, the invention is intended to include such 
other forms of expression vectors, such as viral vectors (e.g., replication defective 
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retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent 
functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 
the invention in a form suitable for expression of the nucleic acid in a host cell, which 
means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, which is 
operatively linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 
of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a 
host cell when the vector is introduced into the host cell). The term "regulatory 
sequence" is intended to include promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel; Gene Expression Technology: Methods in Enzymology 1 85, 
Academic Press, San Diego, CA (1990). Regulatory sequences include those which 
direct constitutive expression of a nucleotide sequence in many types of host cells and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g. , 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of the 
host cell to be transformed, the level of expression of protein desired, and the like. The 
expression vectors of the invention can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., hVR-1, hVR-2, and rVR-2 proteins, mutant forms of hVR-1, 
hVR-2, and rVR-2 proteins, fusion proteins, and the like). 

The recombinant expression vectors of the invention can be designed for 
expression of hVR-1, hVR-2, and rVR-2 proteins in prokaryotic or eukaryotic cells. For 
example, hVR-1, hVR-2, and rVR-2 proteins can be expressed in bacterial cells such as 
£. coli, insect cells (using baculovirus expression vectors) yeast cells or mammalian 
cells. Suitable host cells are discussed further in Goeddel, Gene Expression Technology: 
Methods in Enzymology 185, Academic Press, San Diego, CA (1990). Alternatively, the 
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recombinant expression vector can be transcribed and translated in vitro, for example 
using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in E. coli with 
vectors containing constitutive or inducible promoters directing the expression of either 
5 fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of recombinant protein; 
2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 

1 0 expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from 
the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and 
their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, D.B. 

1 5 and Johnson, K.S. ( 1 988) Gene 67:3 1 -40), pMAL (New England Biolabs, Beverly, MA) 
and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 
maltose E binding protein, or protein A, respectively, to the target recombinant protein. 

Purified fusion proteins can be utilized in hVR-1, hVR-2, and rVR-2 activity 
assays, (e.g., direct assays or competitive assays described in detail below), or to, for 

20 example, generate antibodies specific for hVR-1, hVR-2, and rVR-2 proteins. In a 
embodiment, an hVR- 1 , hVR-2, and rVR-2 fusion protein expressed in a retroviral 
expression vector of the present invention can be utilized to infect bone marrow cells 
which are subsequently transplanted into irradiated recipients. The pathology of the 
subject recipient is then examined after sufficient time has passed (e.g., six (6) weeks). 

25 Examples of suitable inducible non-fusion E. coli expression vectors include 

pTrc (Amann et al, (1988) Gene 69:301-3 15) and pET 1 Id (Studier et al, Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, 
California (1 990) 60-89). Target gene expression from the pTrc vector relies on host 
RNA polymerase transcription from a hybrid trp-lac fusion promoter. Target gene 

30 expression from the pET 1 Id vector relies on transcription from a T7 gnl 0-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase (T7 gnl). This viral 
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polymerase is supplied by host strains BL21(DE3) or HMS174(DE3) from a resident 
prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 
promoter. 

One strategy to maximize recombinant protein expression in E. coli is to express 
5 the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, S., Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, California (1990) 1 19-128). Another 
strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 
expression vector so that the individual codons for each amino acid are those 
10 preferentially utilized in £ coli (Wada et al, (1992) Nucleic Acids Res. 20:21 1 1-2118). 
Such alteration of nucleic acid sequences of the invention can be carried out by standard 
DNA synthesis techniques. 

In another embodiment, the hVR-I, hVR-2, and rVR-2 expression vector is a 
yeast expression vector. Examples of vectors for expression in yeast S. cerivisae include 
1 5 pYepSec 1 (Baldari, et al , (1 987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, 

(1982) Cell 30:933-943), pJRY88 (Schultz et al, (1987) Gene 54:1 13-123), pYES2 
(Invitrogen Corporation, San Diego, CA), and picZ (InVitrogen Corp, San Diego, CA). 

Alternatively, hVR-1 , hVR-2, and rVR-2 proteins can be expressed in insect 
cells using baculovirus expression vectors. Baculovirus vectors available for expression 
20 of proteins in cultured insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al 

(1983) Mol Cell Biol 3:2156-2165) and the pVL series (Lucklow and Summers (1989) 
Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 

25 expression vectors include pCDM8 (Seed, B. (1 987) Nature 329:840) and pMT2PC 
(Kaufman et al (1987) EMBO J. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 

30 prokaryotic and eukaiyotic cells see chapters 16 and 1 7 of Sambrook, J., Fritsh, E. F., 
and Maniatis, T. Molecular Cloning: A Laboratory Manual 2nd, ed. t Cold Spring 
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Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989. 

In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell type 
5 (e.g. , tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al 
(1987) Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton (1988) 
Adv. Immunol. 43:235-275), in particular promoters of T cell receptors (Winoto and 

10 Baltimore (1989) EMBOJ. 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 
(e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc. Natl Acad ScL USA 
86:5473-5477), pancreas-specific promoters (Edlund et al. (1985) Science 230:912-916), 
and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 

1 5 4,873,3 1 6 and European Application Publication No. 264, 1 66). Developmentally- 
regulated promoters are also encompassed, for example the murine box promoters 
(Kessel and Gruss (1990) Science 249:374-379) and the ct-fetoprotein promoter 
(Campes and Tilghman (1989) Genes Dev. 3:537-546). 

The expression characteristics of an endogenous hVR-1, hVR-2, and rVR-2 gene 

20 within a cell line or microorganism may be modified by inserting a heterologous DNA 
regulatory element into the genome of a stable cell line or cloned microorganism such 
that the inserted regulatory element is operatively linked with the endogenous hVR-1, 
hVR-2, and rVR-2 gene. For example, an endogenous hVR-1, hVR-2, and rVR-2 gene 
which is normally "trancriptionally silent", Le. 9 a hVR-1, hVR-2, and rVR-2 gene which 

25 is normally not expressed, or is expressed only at very low levels in a cell line or 

microorganism, may be activated by inserting a regulatory element which is capable of 
promoting the expression of a normally expressed gene product in that cell line or 
microorganism. Alternatively, a transcriptionally silent, endogenous hVR-1, hVR-2, 
and rVR-2 gene, may be activated by insertion of a promiscuous regulatory element that 

30 works across cell types. 
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A heterologous regulatory element may be inserted into a stable cell line or 
cloned microorganism, such that it is operatively linked with an endogenous hVR-1, 
hVR-2, and rVR-2 gene, using techniques, such as targeted homologous recombination, 
which are well known to those of skill in the art, and described e.g., in Chappel, U.S. 
Patent No.: 5,272,071 ; PCT publication No. WO 91/06667, published May 16, 1991 . 

The invention further provides a recombinant expression vector comprising a 
DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in 
a manner which allows for expression (by transcription of the DNA molecule) of an 
RNA molecule which is antisense to hVR-1, hVR-2, and rVR-2 mRNA. Regulatory 
sequences operatively linked to a nucleic acid cloned in the antisense orientation can be 
chosen which direct the continuous expression of the antisense RNA molecule in a 
variety of cell types, for instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen which direct constitutive, tissue specific or cell type specific 
expression of antisense RNA. The antisense expression vector can be in the form of a 
recombinant plasmid, phagemid or attenuated virus in which antisense nucleic acids are 
produced under the control of a high efficiency regulatory region, the activity of which 
can be determined by the cell type into which the vector is introduced. For a discussion 
of the regulation of gene expression using antisense genes see Weintraub, H. et al, 
Antisense RNA as a molecular tool for genetic analysis, Reviews - Trends in Genetics, 
Vol 1(1) 1986. 

Another aspect of the invention pertains to host cells into which an hVR-1, hVR- 
2, and rVR-2 nucleic acid molecule of the invention is introduced, e.g. 9 an hVR-1, hVR- 
2, and rVR-2 nucleic acid molecule within a recombinant expression vector or an hVR- 
1, hVR-2, and rVR-2 nucleic acid molecule containing sequences which allow it to 
homologously recombine into a specific site of the host cell's genome. The terms "host 
cell" and "recombinant host ceil" are used interchangeably herein. It is understood that 
such terms refer not only to the particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications may occur in succeeding 
generations due to either mutation or environmental influences, such progeny may not, 
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in fact, be identical to the parent cell, but are still included within the scope of the term 
as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, an hVR-1, 
hVR-2, and rVR-2 protein can be expressed in bacterial cells such as E. coli> insect cells, 
5 yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). 
Other suitable host cells are known to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation' 1 and "transfection" are intended to refer to a variety of art-recognized 

1 0 techniques for introducing foreign nucleic acid (e.g. , DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, et al. {Molecular Cloning: A 
Laboratory Manual. 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor 

15 Laboratory Press, Cold Spring Harbor, NY, 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 

20 generally introduced into the host cells along with the gene of interest, selectable 

markers include those which confer resistance to drugs, such as G418, hygromycin and 
methotrexate. Nucleic acid encoding a selectable marker can be introduced into a host 
cell on the same vector as that encoding an hVR-1, hVR-2, and rVR-2 protein or can be 
introduced on a separate vector. Cells stably transfected with the introduced nucleic 

25 acid can be identified by drug selection (e.g. , cells that have incorporated the selectable 
marker gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (/.e., express) an hVR-1, hVR-2, and rVR-2 protein. 
Accordingly, the invention further provides methods for producing an hVR-l, hVR-2, 

30 and rVR-2 protein using the host cells of the invention. In one embodiment, the method 
comprises culturing the host cell of invention (into which a recombinant expression 
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vector encoding an hVR-1, hVR-2, and rVR-2 protein has been introduced) in a suitable 
medium such that an hVR-1, hVR-2, and rVR-2 protein is produced. In another 
embodiment, the method further comprises isolating an hVR-1, hVR-2, and rVR-2 
protein from the medium or the host cell. 
5 The host cells of the invention can also be used to produce non-human transgenic 

animals. For example, in one embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which hVR-1 , hVR-2, and rVR-2-coding 
sequences have been introduced. Such host cells can then be used to create non-human 
transgenic animals in which exogenous hVR-1, hVR-2, and rVR-2 sequences have been 

10 introduced into their genome or homologous recombinant animals in which endogenous 
hVR-1, hVR-2, and rVR-2 sequences have been altered. Such animals are useful for 
studying the function and/or activity of an hVR-1, hVR-2, and rVR-2 and for identifying 
and/or evaluating modulators of hVR-1, hVR-2, and rVR-2 activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 

15 rodent such as a rat or mouse, in which one or more of the cells of the animal includes a 
transgene. Other examples of transgenic animals include non-human primates, sheep, 
dogs, cows, goats, chickens, amphibians, and the like. A transgene is exogenous DNA 
which is integrated into the genome of a cell from which a transgenic animal develops 
and which remains in the genome of the mature animal, thereby directing the expression 

20 of an encoded gene product in one or more cell types or tissues of the transgenic animal. 
As used herein, a "homologous recombinant animal" is a non-human animal, preferably 
a mammal, more preferably a mouse, in which an endogenous hVR-1, hVR-2, and rVR- 
2 gene has been altered by homologous recombination between the endogenous gene 
and an exogenous DNA molecule introduced into a cell of the animal, e.g., an 

25 embryonic cell of the animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing an hVR-1, 
hVR-2, and rVR-2-encoding nucleic acid into the male pronuclei of a fertilized oocyte, 
e.g., by microinjection, retroviral infection, and allowing the oocyte to develop in a 
pseudopregnant female foster animal. The hVR-1, hVR-2, and rVR-2 cDNA sequence 

30 of SEQ ID NO: 1 , 3, 5, 7 or 9 can be introduced as a transgene into the genome of a non- 
human animal. Alternatively, a nonhuman homologue of a hVR-2 gene, such as a 
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mouseorrathVR-2,, g .,therVR-2gene,canbeusedasat r ans g ene. Alternatively an 
hVR- 1 , hVR-2, and rVR-2 gene homologue, such as another member of the 
Capsaicin/Vanilloid family, can be isolated based on hybridization to the hVR-1 hVR-2 
and rVR-2 cDNA sequences of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, (describedfurther ' 
5 m subsection I above) and used as a transgene. Intronic sequences and polyadenylation 
s,gnals can also be included in the transgene to increase the efficiency of expression of 
the transgene. A tissue-specific regulatory sequence(s) can be operably linked to an 
hVR-1, hVR-2, and rVR-2 transgene to direct expression of an hVR-1, hVR-2, and rVR- 
2 protein to particular cells. Methods for generating transgenic animals via embryo 
10 mampulation and microinjection, particularly animals such as mice, have become 

conventional in the art and are described, for example, in U.S. Patent Nos. 4,736,866 and 
4,870,009, both by Leder U.S. PatentNo. 4,873,191 by Wagner el al. and in 
Hogan, B., Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press 
Cold Spring Harbor, N.Y., 1 986). Similar methods are used for production of other ' 
15 transgemc animals. A transgenic founder animal can be identified based upon the 
presence of an hVR-1, hVR-2, and rVR-2 transgene in its genome and/or expression of 
hVR-I, hVR-2, and rVR-2 mRNA in tissues or cells of the animals. A transgenic 
founder animal can then be used to breed additional animals carrying the transgene. 
Moreover, transgenic animals carrying a transgene encoding an hVR-1, hVR-2 and 
20 rVR-2 protein can further be bred to other transgenic animals carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains 
at least a portion of an hVR-I, hVR-2, and rVR-2 gene into which a deletion, addition or 
substitute has been introduced to thereby alter, e.g., functionally disrupt, the hVR-1 
hVR-2, and rVR-2 gene. The VR-1 or VR-2 gene can be a human gene (*,,, the cDNA 
25 ^EQmNO:l,3,5,4,6,7,or9)^ 

a hVR-1 and hVR-2 gene (e.g., the cDNA of SEQ ID NO:10 or 12, or a cDNA isolated 
by stringent hybridization with the nucleotide sequence of SEQ ID NO: 1, 3, 5, 4 , 6 , 7 
or 9). For example, a mouse VR-2 gene can be used to construct a homologous 
recombination nucleic acid molecule, e.g. , a vector, suitable for altering an endogenous 
30 VR-2 gene in the mouse genome. In a embodiment, the homologous recombination 
nucle.c acid molecule is designed such that, upon homologous recombination, the 
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endogenous hVR-1, hVR-2, and rVR-2 gene is functionally disrupted (/.e., no longer 
encodes a functional protein; also referred to as a "knock out" vector). Alternatively, the 
homologous recombination nucleic acid molecule can be designed such that, upon 
homologous recombination, the endogenous hVR-1, hVR-2, and rVR-2 gene is mutated 
or otherwise altered but still encodes functional protein (e.g., the upstream regulatory 
region can be altered to thereby alter the expression of the endogenous hVR-I, hVR-2, 
and rVR-2 protein). In the homologous recombination nucleic acid molecule, the altered 
portion of the hVR-1 , hVR-2, and rVR-2 gene is flanked at its 5' and 3' ends by 
additional nucleic acid sequence of the hVR-1, hVR-2, and rVR-2 gene to allow for 
homologous recombination to occur between the exogenous hVR-1, hVR-2, and rVR-2 
gene carried by the homologous recombination nucleic acid molecule and an 
endogenous hVR-1, hVR-2. and rVR-2 gene in a cell, e.g., an embryonic stem cell. The 
additional flanking hVR-1, hVR-2, and rVR-2 nucleic acid sequence is of sufficient 
length for successful homologous recombination with the endogenous gene. Typically, 
several kilobases of flanking DNA (both at the 5* and 3' ends) are included in the 
homologous recombination nucleic acid molecule (see, e.g., Thomas, K.R. and 
Capecchi, M. R. (1987) Cell 51:503 for a description of homologous recombination 
vectors). The homologous recombination nucleic acid molecule is introduced into a cell, 
e.g., an embryonic stem cell line (e.g., by electroporation) and cells in which the 
introduced hVR-l, hVR-2, and rVR-2 gene has homologously recombined with the 
endogenous hVR-1 , hVR-2, and rVR-2 gene are selected (see e.g. , Li, E. ei al (1 992) 
Cell 69:915). The selected cells can then injected into a blastocyst of an animal (e.g., a 
mouse) to form aggregation chimeras (see e.g., Bradley, A. in Teratocarcinomas and 
Embryonic Stem Cells: A Practical Approach^ E.J. Robertson, ed. (IRL, Oxford, 1987) 
pp. 1 13-152). A chimeric embryo can then be implanted into a suitable pseudopregnant 
female foster animal and the embryo brought to term. Progeny harboring the 
homologously recombined DNA in their germ cells can be used to breed animals in 
which all cells of the animal contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing homologous recombination 
nucleic acid molecules, e.g., vectors, or homologous recombinant animals are described 
further in Bradley, A. (1991) Current Opinion in Biotechnology 2:823-829 and in PCT 
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International Publication Nos.: WO 90/1 1354 by Le Mduellec et al\ WO 91/01 140 by 
Smithies et al\ WO 92/0968 by Zijlstra et al.; and WO 93/04169 by Berns et al 

In another embodiment, transgenic non-humans animals can be produced which 
contain selected systems which allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PI . For 
a description of the cre/loxP recombinase system, see, e.g., Lakso et al (1992) Proc. 
Natl. Acad ScL USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al. (1991) Science 
251 :1351-1355. If a cre/loxP recombinase system is used to regulate expression of the 
transgene, animals containing transgenes encoding both the Cre recombinase and a 
selected protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
recombinase. 

Clones of the non-human transgenic animals described herein can also be 
produced according to the methods described in Wilmut, I. et al. (1 997) Nature 385:810- 
813 and PCT International Publication Nos. WO 97/07668 and WO 97/07669. In brief, 
a cell, e.g., a somatic cell, from the transgenic animal can be isolated and induced to exit 
the growth cycle and enter G 0 phase. The quiescent cell can then be fused, e.g., through 
the use of electrical pulses, to an enucleated oocyte from an animal of the same species 
from which the quiescent cell is isolated. The recontructed oocyte is then cultured such 
that it develops to morula or blastocyte and then transferred to pseudopregnant female 
foster animal. The offspring borne of this female foster animal will be a clone of the 
animal from which the cell, e.g., the somatic cell, is isolated. 

IV. Pharmaceutical Compositions 

The hVR-1, hVR-2, and rVR-2 nucleic acid molecules, fragments of hVR-1, 
hVR-2, and rVR-2 proteins, and anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies 
(also referred to herein as "active compounds") of the invention can be incorporated into 
pharmaceutical compositions suitable for administration. Such compositions typically 
comprise the nucleic acid molecule, protein, or antibody and a pharmaceutically 
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acceptable carrier. As used herein the language "pharmaceutical^ acceptable carrier" is 
intended to include any and all solvents, dispersion media, coatings, antibacterial and 
antifungal agents, isotonic and absorption delaying agents, and the like, compatible with 
pharmaceutical administration. The use of such media and agents for pharmaceutically 
5 active substances is well known in the art. Except insofar as any conventional media or 
agent is incompatible with the active compound, use thereof in the compositions is 
contemplated. Supplementary active compounds can also be incorporated into the 
compositions. 

A pharmaceutical composition of the invention is formulated to be compatible 
1 0 with its intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), 
transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions 
used for parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
15 polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 

antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic 
acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, 
20 such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 
enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersion. For 
intravenous administration, suitable carriers include physiological saline, bacteriostatic 
water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline (PBS). In 
all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 
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lactose, a disintegrating agent such as alginic acid, Primogel, or com starch; a lubricant 
such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 
methyl salicylate, or orange flavoring. 
5 For administration by inhalation, the compounds are delivered in the form of an 

aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be 

10 permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
compounds are formulated into ointments, salves, gels, or creams as generally known in 

15 the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will 

20 protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the art. 

25 The materials can also be obtained commercially from Alza Corporation and Nova 

Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutical^ 
acceptable carriers. These can be prepared according to methods known to those skilled 
in the art, for example, as described in U.S. Patent No. 4,522,81 1. 



W ° 00/29577 PCT/US99/2670. 

-56- 

It is especially advantageous to formulate oral or parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 
5 calculated to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound 
and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 
10 Toxicity and therapeutic efficacy of such compounds can be determined by 

standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for 
determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose 
therapeutically effective in 50% of the population). The dose ratio between toxic and 
therapeutic effects is the therapeutic index and it can be expressed as the ratio 
15 LD50/ED50. Compounds which exhibit large therapeutic indices are preferred. While 
compounds that exhibit toxic side effects may be used, care should be taken to design a 
delivery system that targets such compounds to the site of affected tissue in order to 
minimize potential damage to uninfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in 
20 formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the ED50 with little 
or no toxicity. The dosage may vary within this range depending upon the dosage form 
employed and the route of administration utilized. For any compound used in the 
method of the invention, the therapeutically effective dose can be estimated initially 
25 from cell culture assays. A dose may be formulated in animal models to achieve a 
circulating plasma concentration range that includes the IC50 (/.*., the concentration of 
the test compound which achieves a half-maximal inhibition of symptoms) as 
determined in cell culture. Such information can be used to more accurately determine 
useful doses in humans. Levels in plasma may be measured, for example, by high 
30 performance liquid chromatography. 
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As defined herein, a therapeutically effective amount of protein or polypeptide 
(i.e., an effective dosage) ranges from about 0.001 to 30 mg/kg body weight, preferably 
about 0.01 to 25 mg/kg body weight, more preferably about 0.1 to 20 mg/kg body 
weight, and even more preferably about 1 to 10 mg/kg, 2 to 9 mg/kg, 3 to 8 mg/kg, 4 to 
5 7 mg/kg, or 5 to 6 mg/kg body weight. The skilled artisan will appreciate that certain 
factors may influence the dosage required to effectively treat a subject, including but not 
limited to the severity of the disease or disorder, previous treatments, the general health 
and/or age of the subject, and other diseases present. Moreover, treatment of a subject 
with a therapeutically effective amount of a protein, polypeptide, or antibody can 

10 include a single treatment or, preferably, can include a series of treatments. In a 

preferred example, a subject is treated with antibody, protein, or polypeptide in the range 
of between about 0.1 to 20 mg/kg body weight, one time per week for between about 1 
to 10 weeks, preferably between 2 to 8 weeks, more preferably between about 3 to 7 
weeks, and even more preferably for about 4, 5, or 6 weeks. It will also be appreciated 

1 5 that the effective dosage of antibody, protein, or polypeptide used for treatment may 
increase or decrease over the course of a particular treatment. Changes in dosage may 
result and become apparent from the results of diagnostic assays as described herein. 

The present invention encompasses agents which modulate expression or 
activity. An agent may, for example, be a small molecule. For example, such small 

20 molecules include, but are not limited to, peptides, peptidomimetics, amino acids, amino 
acid analogs, polynucleotides, polynucleotide analogs, nucleotides, nucleotide analogs, 
organic or inorganic compounds (i.e,. including heteroorganic and organometallic 
compounds) having a molecular weight less than about 10,000 grams per mole, organic 
or inorganic compounds having a molecular weight less than about 5,000 grams per 

25 mole, organic or inorganic compounds having a molecular weight less than about 1,000 
grams per mole, organic or inorganic compounds having a molecular weight less than 
about 500 grams per mole, and salts, esters, and other pharmaceutical^ acceptable forms 
of such compounds. 

It is understood that appropriate doses of small molecule agents depends upon a 
30 number of factors within the ken of the ordinarily skilled physician, veterinarian, or 
researcher. The dose(s) of the small molecule will vary, for example, depending upon 
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the identity, size, and condition of the subject or sample being treated, further depending 
upon the route by which the composition is to be administered, if applicable, and the 
effect which the practitioner desires the small molecule to have upon the nucleic acid or 
polypeptide of the invention. 
5 Exemplary doses include milligram or microgram amounts of the small molecule 

per kilogram of subject or sample weight (e.g., about 1 microgram per kilogram to about 
500 milligrams per kilogram, about 100 micrograms per kilogram to about 5 milligrams 
per kilogram, or about 1 microgram per kilogram to about 50 micrograms per kilogram. 
It is furthermore understood that appropriate doses of a small molecule depend upon the 

10 potency of the small molecule with respect to the expression or activity to be modulated. 
Such appropriate doses may be determined using the assays described herein. 

When one or more of these small molecules is to be administered to an animal 
(e.g., a human) in order to modulate expression or activity of a polypeptide or nucleic 
acid of the invention, a physician, veterinarian, or researcher may, for example, 

15 prescribe a relatively low dose at first, subsequently increasing the dose until an 

appropriate response is obtained. In addition, it is understood that the specific dose level 
for any particular animal subject will depend upon a variety of factors including the 
activity of the specific compound employed, the age, body weight, general health, 
gender, and diet of the subject, the time of administration, the route of administration, 

20 the rate of excretion, any drug combination, and the degree of expression or activity to 
be modulated. 

The nucleic acid molecules of the invention can be inserted into vectors and used 
as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (see U.S. Patent 5,328,470) or by 

25 stereotactic injection (see e.g., Chen et al (1994) Proc. Natl. Acad ScL USA 91 :3054- 
3057). The pharmaceutical preparation of the gene therapy vector can include the gene 
therapy vector in an acceptable diluent, or can comprise a slow release matrix in which 
the gene delivery vehicle is imbedded. Alternatively, where the complete gene delivery 
vector can be produced intact from recombinant cells, e.g. , retroviral vectors, the 

30 pharmaceutical preparation can include one or more cells which produce the gene 
delivery system. 
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The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

V. Uses and Methods of the Invention 
5 The nucleic acid molecules, proteins, protein homologues, and antibodies 

described herein can be used in one or more of the following methods: a) screening 
assays; b) predictive medicine (e.g., diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenetics); and c) methods of treatment (e.g., therapeutic and 
prophylactic). As described herein, an hVR-1, hVR-2, and rVR-2 protein of the 

1 0 invention has one or more of the following activities: ( 1 ) it interacts with a non-hVR- 1 , 
non-hVR-2, and non-rVR-2 protein molecule, e.g., a vanilloid compound such as 
capsaicin; (2) it modulates intracellular calcium concentration; (3) it activates an hVR- 
1, hVR-2, and rVR-2-dependent signal transduction pathway; and (4) it modulates a pain 
signaling mechanism, and, thus, can be used to, for example, (1) modulate the 

15 interaction with a non-hVR-1, non-hVR-2, and non-rVR-2 protein molecule; (2) 

modulate intracellular calcium concentration; (3) activate an hVR-1, hVR-2, and rVR- 
2-dependent signal transduction pathway; and (4) modulate a pain signaling mechanism. 

The isolated nucleic acid molecules of the invention can be used, for example, to 
express hVR-1, hVR-2, and rVR-2 protein (e.g., via a recombinant expression vector in 

20 a host cell in gene therapy applications), to detect hVR- 1 , hVR-2, and rVR-2 mRNA 
(e.g., in a biological sample) or a genetic alteration in an hVR-1 , hVR-2, and rVR-2 
gene, and to modulate hVR-1, hVR-2, and rVR-2 activity, as described further below. 
The hVR-1, hVR-2, and rVR-2 proteins can be used to screen for naturally occurring 
hVR-1, hVR-2, and rVR-2 substrates, to screen for drugs or compounds which modulate 

25 hVR-1, hVR-2, and rVR-2 activity, as well as to treat disorders characterized by 

insufficient or excessive production of hVR-1, hVR-2, and rVR-2 protein or production 
of hVR-1, hVR-2, and rVR-2 protein forms which have decreased or aberrant activity 
compared to hVR-1, hVR-2, and rVR-2 wild type protein (e.g., pain disorders). 
Moreover, the anti-hVR-1 , anti-hVR-2, and anti-rVR-2 antibodies of the invention can 

30 be used to detect and isolate hVR-1, hVR-2, and rVR-2 proteins, regulate the 
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bioavailability of hVR-1, hVR-2, and rVR-2 proteins, arid modulate hVR-1, hVR-2, and 
rVR-2 activity. 

A. Screening Assays : 
5 The invention provides a method (also referred to herein as a "screening assay") 

for identifying modulators, /.e., candidate or test compounds or agents (e.g., peptides, 
peptidomimetics, small molecules or other drugs) which bind to hVR-1, hVR-2, and 
rVR-2 proteins, have a stimulatory or inhibitory effect on, for example, hVR-1, hVR-2, 
and rVR-2 expression or hVR-1, hVR-2, and rVR-2 activity, or have a stimulatory or 
1 0 inhibitory effect on, for example, the expression or activity of hVR- 1 , h VR-2, and rVR- 
2 substrate. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which are substrates of an hVR-1, hVR-2, and rVR-2 protein or polypeptide 
or biologically active portion thereof. In another embodiment, the invention provides 

1 5 assays for screening candidate or test compounds which bind to or modulate the activity 
of an hVR-1, hVR-2, and rVR-2 protein or polypeptide or biologically active portion 
thereof. The test compounds of the present invention can be obtained using any of the 
numerous approaches in combinatorial library methods known in the art, including: 
biological libraries; spatially addressable parallel solid phase or solution phase libraries; 

20 synthetic library methods requiring deconvolution; the 'one-bead one-compound' library 
method; and synthetic library methods using affinity chromatography selection. The 
biological library approach is limited to peptide libraries, while the other four 
approaches are applicable to peptide, non-peptide oligomer or small molecule libraries 
of compounds (Lam, K.S. (1997) Anticancer Drug Des. 12:145). 

25 Examples of methods for the synthesis of molecular libraries can be found in the 

art, for example in: DeWitt et al (1993) Proc. Natl Acad Set U.S.A. 90:6909; Erb et 
al (1994) Proc. Natl Acad. Set USA 91:I1422;Zuckennanne/a/. (1994).J. Med 
Chem. 37:2678; Cho et al (1993) Science 261 :1303; Cairell et al (1994) Angew. Chem. 
Int. Ed Engl 33:2059; Carell et al (1994) Angew. Chem Int. Ed. Engl 33:2061; and in 

30 Gallop et al (1994) J. Med Chem. 37:1233. 
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Libraries of compounds may be presented in solution (e.g., Houghten (1992) 
Biotechniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 
(1993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), spores (Ladner USP 
'409), plasmids (Cull et al. (1992) Proc Natl Acad Sci USA 89:1 865-1869) or on phage 
(Scott and Smith (1990) Science 249:386-390); (Devlin (1990) Science 249:404-406); 
(Cwirla et al (1990) Proc, Natl Acad Sci. 87:6378-6382); (Felici (1991) J. Mol Biol 
222:301-3 1 0); (Ladner supra. ). 

In one embodiment, an assay is a cell-based assay in which a cell, e.g., a 
neuronal cell, which expresses an hVR-1, hVR-2, and rVR-2 protein or biologically 
active portion thereof is contacted with a test compound and the ability of the test 
compound to modulate hVR-1, hVR-2, and rVR-2 activity is determined. Determining 
the ability of the test compound to modulate hVR-1, hVR-2, and rVR-2 activity can be 
accomplished by monitoring, for example, intracellular calcium concentration or 
membrane depolarization by, e.g., patch-clamp recordings in whole-cell, inside-out, and 
outside-out configurations (as described in, for example, Tominaga M. et al (1998) 
Neuron 21 :53 1-543). Determining the ability of the test compound to modulate hVR-1, 
hVR-2, and rVR-2 activity can further be accomplished by monitoring the activity of an 
hVR-1, hVR-2, and rVR-2-regulated transcription factor. The cell, for example, can be 
of mammalian origin, e.g., a neuronal cell. 

The ability of the test compound to modulate hVR-1, hVR-2, and rVR-2 binding 
to a substrate or to bind to hVR-1, hVR-2, and rVR-2 can also be determined. 
Determining the ability of the test compound to modulate hVR-1, hVR-2, and rVR-2 
binding to a substrate can be accomplished, for example, by coupling the hVR-1, hVR- 
2, and rVR-2 substrate with a radioisotope or enzymatic label such that binding of the 
hVR-1, hVR-2, and rVR-2 substrate to hVR-1, hVR-2, and rVR-2 can be determined by 
detecting the labeled hVR-1 , hVR-2, and rVR-2 substrate in a complex. Determining 
the ability of the test compound to bind hVR-1, hVR-2, and rVR-2 can be accomplished, 
for example, by coupling the compound with a radioisotope or enzymatic label such that 
binding of the compound to hVR-1 , hVR-2, and rVR-2 can be determined by detecting 
the labeled hVR-1 , hVR-2, and rVR-2 compound in a complex. For example, 
compounds (e.g., hVR-1, hVR-2, and rVR-2 substrates) can be labeled with 125 I, 35 S, 
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14 C, or 3 H, either directly or indirectly, and the radioisotope detected by direct counting 
of radioemmission or by scintillation counting. Alternatively, compounds can be 
enzymatically labeled with, for example, horseradish peroxidase, alkaline phosphatase, 
or luciferase, and the enzymatic label detected by determination of conversion of an 
5 appropriate substrate to product. 

It is also within the scope of this invention to determine the ability of a 
compound (e.g., an hVR-I, hVR-2, and rVR-2 substrate) to interact with hVR-1, hVR-2, 
and rVR-2 without the labeling of any of the interactants. For example, a 
microphysiometer can be used to detect the interaction of a compound with hVR-1, 
10 hVR-2, and rVR-2 without the labeling of either the compound or the hVR-1, hVR-2, 
andrVR-2. McConnell, H. M. et al. (1992) Science 257:1906-1912. As used herein, a 
"microphysiometer" (e.g., Cytosensor) is an analytical instrument that measures the rate 
at which a cell acidifies its environment using a light-addressable potentiometric sensor 
(LAPS). Changes in this acidification rate can be used as an indicator of the interaction 
1 5 between a compound and h VR- 1 , hVR-2, and rVR-2. 

In yet another embodiment, an assay of the present invention is a cell-free assay 
in which an hVR-1, hVR-2, and rVR-2 protein or biologically active portion thereof is 
contacted with a test compound and the ability of the test compound to bind to the hVR- 
1, hVR-2, and rVR-2 protein or biologically active portion thereof is determined. 
20 biologically active portions of the hVR-1, hVR-2, and rVR-2 proteins to be used in 
assays of the present invention include fragments which participate in interactions with 
non-hVR-1, non-hVR-2, and non-rVR-2 molecules, e.g., fragments with high surface 
probability scores. Binding of the test compound to the hVR-1, hVR-2, and rVR-2 
protein can be determined either directly or indirectly as described above. In a 
embodiment, the assay includes contacting the hVR-1, hVR-2, and rVR-2 protein or 
biologically active portion thereof with a known compound which binds hVR- 1 , hVR-2, 
and rVR-2 to form an assay mixture, contacting the assay mixture with a test compound, 
and determining the ability of the test compound to interact with an hVR-1 , hVR-2, and 
rVR-2 protein, wherein determining the ability of the test compound to interact with an 
hVR-1, hVR-2, and rVR-2 protein comprises detenruning the ability of the test 
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compound to preferentially bind to hVR-1. hVR-2, and rVR-2 or biologically active 
portion thereof as compared to the known compound. 

In another embodiment, the assay is a cell-free assay in which an hVR-1, hVR-2, 
and rVR-2 protein or biologically active portion thereof is contacted with a test 
compound and the ability of the test compound to modulate (e.g., stimulate or inhibit) 
the activity of the hVR-1, hVR-2, and rVR-2 protein or biologically active portion 
thereof is determined. Determining the ability of the test compound to modulate the 
activity of an hVR-1, hVR-2, and rVR-2 protein can be accomplished, for example, by 
determining the ability of the hVR-1, hVR-2, and rVR-2 protein to bind to an hVR-1, 
hVR-2, and rVR-2 target molecule, e.g., a vanilloid compound such as capsaicin, by one 
of the methods described above for determining direct binding. Determining the ability 
of the hVR-1, hVR-2, and rVR-2 protein to bind to an hVR-1, hVR-2, and rVR-2 target 
molecule can also be accomplished using a technology such as real-time Biomolecular 
Interaction Analysis (BIA). Sjolander, S. and Urbaniczky, C. (1991) Ami Chem. 
63:2338-2345 and Szabo et al. (1995) Curr. Opin. Struct. Biol. 5:699-705. As used 
herein, "BIA" is a technology for studying biospecific interactions in real time, without 
labeling any of the interacts (e.g., BlAcore). Changes in the optical phenomenon of 
surface plasmon resonance (SPR) can be used as an indication of real-time reactions 

between biological molecules. 

In an alternative embodiment, determining the ability of the test compound to 
modulate the activity of an hVR-1 , hVR-2, and rVR-2 protein can be accomplished by 
detemuning the ability of the hVR-1 , hVR-2, and rVR-2 protein to further modulate the 
activity of a downstream effector of an hVR-1, hVR-2, and rVR-2 target molecule. For 
example, the activity of the effector molecule on an appropriate target can be determined 
or the binding of the effector to an appropriate target can be determined as previously 
described. 

In yet another embodiment, the cell-free assay involves contacting an hVR-1, 
hVR-2, and rVR-2 protein or biologically active portion thereof with a known 
compound which binds the hVR-1, hVR-2, and rVR-2 protein to form an assay mixture, 
contacting the assay mixture with a test compound, and determining the ability of the 
test compound to interact with the hVR-1, hVR-2, and rVR-2 protein, wherein 
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determining the ability of the test compound to interact with the hVR-1 , hVR-2, and 
rVR-2 protein comprises determining the ability of the hVR-1, hVR-2, and rVR-2 
protein to preferentially bind to or modulate the activity of an hVR-1, hVR-2, and rVR-2 
target molecule. 

5 The cell-free assays of the present invention are amenable to use of both soluble 

and/or membrane-bound forms of isolated proteins (e.g. , hVR-1, hVR-2, and rVR-2 
proteins or biologically active portions thereof). In the case of cell-free assays in which 
a membrane-bound form of an isolated protein is used it may be desirable to utilize a 
solubilizing agent such that the membrane-bound form of the isolated protein is 
1 0 maintained in solution. Examples of such solubilizing agents include non-ionic 

detergents such as n-octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl- 
N-methylglucamide, decanoyl-N-methylglucamide, Triton® X-100, Triton® X-l 14, 
Thesit®, Isotridecypoly(ethylene glycol ether) n , 3-[(3- 
cholamidopropyl)dimethylamminio]-l -propane sulfonate (CHAPS), 3-[(3- 
1 5 cholarrudopropyl)dimethyIarnminio]-2-hydroxy-l-propane sulfonate (CHAPSO), or N- 
dodecy l=N,N-dimethyl-3-ammonio- 1 -propane sulfonate. 

In more than one embodiment of the above assay methods of the present 
invention, it may be desirable to immobilize either hVR-1, hVR-2, and rVR-2 or its 
target molecule to facilitate separation of complexed from uncomplexed forms of one or 
20 both of the proteins, as well as to accommodate automation of the assay. Binding of a 
test compound to an hVR-1, hVR-2, and rVR-2 protein, or interaction of an hVR-1, 
hVR-2, and rVR-2 protein with a target molecule in the presence and absence of a 
candidate compound, can be accomplished in any vessel suitable for containing the 
reactants. Examples of such vessels include microtitre plates, test tubes, and micro- 
25 centrifuge tubes. In one embodiment, a fusion protein can be provided which adds a 
domain that allows one or both of the proteins to be bound to a matrix. For example, 
glutathione-S-transferase/hVR-1, hVR-2, and rVR-2 fusion proteins or glutathione-S- 
transferase/target fusion proteins can be adsorbed onto glutathione sepharose beads 
(Sigma Chemical, St. Louis, MO) or glutathione derivatized microtitre plates, which are 
30 then combined with the test compound or the test compound and either the non-adsorbed 
target protein or hVR-1 , hVR-2, and rVR-2 protein, and the mixture incubated under 
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conditions conducive to complex formation (e.g., at physiological conditions for salt and 
pH). Following incubation, the beads or microtitre plate wells are washed to remove 
any unbound components, the matrix immobilized in the case of beads, complex 
determined either directly or indirectly, for example, as described above. Alternatively, 
5 the complexes can be dissociated from the matrix, and the level of hVR-1, hVR-2, and 
rVR-2 binding or activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either an hVR-1, hVR-2, and rVR-2 
protein or an hVR-1, hVR-2, and rVR-2 target molecule can be immobilized utilizing 
10 conjugation of biotin and streptavidin. Biotinylated hVR-1 , hVR-2, and rVR-2 protein 
or target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). 
Alternatively, antibodies reactive with hVR-1, hVR-2, and rVR-2 protein or target 
15 molecules but which do not interfere with binding of the hVR-1, hVR-2, and rVR-2 
protein to its target molecule can be derivatized to the wells of the plate, and unbound 
target or hVR-1 , hVR-2, and rVR-2 protein trapped in the wells by antibody 
conjugation. Methods for detecting such complexes, in addition to those described 
above for the GST-immobilized complexes, include immunodetection of complexes 
20 using antibodies reactive with the hVR-1, hVR-2, and rVR-2 protein or target molecule, 
as well as enzyme-linked assays which rely on detecting an enzymatic activity 
associated with the hVR-1, hVR-2, and rVR-2 protein or target molecule. 

In another embodiment, modulators of hVR-1, hVR-2, and rVR-2 expression are 
identified in a method wherein a cell is contacted with a candidate compound and the 
25 expression of hVR-1, hVR-2, and rVR-2 mRNA or protein in the cell is determined. 
The level of expression of hVR-1 , hVR-2, and rVR-2 mRNA or protein in the presence 
of the candidate compound is compared to the level of expression of hVR-1, hVR-2, and 
rVR-2 mRNA or protein in the absence of the candidate compound. The candidate 
compound can then be identified as a modulator of hVR-1 , hVR-2, and rVR-2 
30 expression based on this comparison. For example, when expression of hVR-1, hVR-2, 
and rVR-2 mRNA or protein is greater (statistically significantly greater) in the presence 
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of the candidate compound than in its absence, the candidate compound is identified as a 
stimulator of hVR-1, hVR-2, and rVR-2 mRNA or protein expression. Alternatively, 
when expression of hVR-1, hVR-2, and rVR-2 mRNA or protein is less (statistically 
significantly less) in the presence of the candidate compound than in its absence, the 
5 candidate compound is identified as an inhibitor of hVR-1 , hVR-2, and rVR-2 mRNA or 
protein expression. The level of hVR-1, hVR-2, and rVR-2 mRNA or protein 
expression in the cells can be determined by methods described herein for detecting 
hVR-1, hVR-2, and rVR-2 mRNA or protein. 

In yet another aspect of the invention, the hVR-1, hVR-2, and rVR-2 proteins 

10 can be used as "bait proteins" in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. 
Patent No. 5,283,317; Zervos et al (1993) Cell 72:223-232; Madura et al (1993) J. 
Biol Chem. 268:12046-12054; BarteU/ al (1993) Biotechniques 14:920-924; Iwabuchi 
et al (1993) Oncogene 8:1693-1696; and Brent WO94/I0300), to identify other 
proteins, which bind to or interact with hVR-1, hVR-2, and rVR-2 ("hVR-1 -binding 

1 5 proteins", M hVR-2-binding proteins' 1 , and "rVR-2-binding proteins" or "hVR-1 -bp", 
"hVR-2-bp", and "rVR-2-bp") and are involved in hVR-1, hVR-2, and rVR-2 activity. 
Such hVR-1, hVR-2, and rVR-2-binding proteins are also likely to be involved in the 
propagation of signals by the hVR-1, hVR-2, and rVR-2 proteins or hVR-1, hVR-2, and 
rVR-2 targets as, for example, downstream elements of an hVR-1, hVR-2, and rVR-2- 

20 mediated signaling pathway, e.g. , a pain signaling pathway. Alternatively, such hVR- 1 , 
hVR-2, and rVR-2-binding proteins are likely to be hVR-1, hVR-2, and rVR-2 
inhibitors. 

The two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. Briefly, the 

25 assay utilizes two different DNA constructs. In one construct, the gene that codes for an 
hVR- 1 , hVR-2, and rVR-2 protein is fused to a gene encoding the DNA binding domain 
of a known transcription factor (e.g., GAL-4). In the other construct, a DNA sequence, 
from a library of DNA sequences, that encodes an unidentified protein ("prey" or 
"sample") is fused to a gene that codes for the activation domain of the known 

30 transcription factor. If the "bait" and the "prey" proteins are able to interact, in vivo, 
forming an hVR-1, hVR-2, and rVR-2-dependent complex, the DNA-binding and 
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activation domains of the transcription factor are brought into close proximity. This 
proximity allows transcription of a reporter gene (e.g., LacZ) which is operably linked to 
a transcriptional regulatory site responsive to the transcription factor. Expression of the 
reporter gene can be detected and cell colonies containing the functional transcription 
5 factor can be isolated and used to obtain the cloned gene which encodes the protein 
which interacts with the hVR- 1 , hVR-2, and rVR-2 protein. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 
agent identified as described herein in an appropriate animal model. For example, an 
10 agent identified as described herein (e.g., an hVR-1, hVR-2, and rVR-2 modulating 

agent, an antisense hVR-1, hVR-2, and rVR-2 nucleic acid molecule, an hVR-1, hVR-2, 
and rVR-2-specific antibody, or an hVR-1, hVR-2, and rVR-2-binding partner) can be 
used in an animal model to determine the efficacy, toxicity, or side effects of treatment 
with such an agent. Alternatively, an agent identified as described herein can be used in 
15 an animal model to determine the mechanism of action of such an agent. Furthermore, 
this invention pertains to uses of novel agents identified by the above-described 
screening assays for treatments as described herein. 

B. Detection Assays 

20 Portions or fragments of the cDNA sequences identified herein (and the 

corresponding complete gene sequences) can be used in numerous ways as 
polynucleotide reagents. For example, these sequences can be used to: (i) map their 
respective genes on a chromosome; and, thus, locate gene regions associated with 
genetic disease; (ii) identify an individual from a minute biological sample (tissue 

25 typing); and (iii) aid in forensic identification of a biological sample. These applications 
are described in the subsections below. 

1 . Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
30 sequence can be used to map the location of the gene on a chromosome. This process is 
called chromosome mapping. Accordingly, portions or fragments of the hVR-1 , hVR-2, 
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and rVR-2 nucleotide sequences, described herein, can be used to map the location of 
the hVR-I , hVR-2, and rVR-2 genes on a chromosome. The mapping of the hVR-1 , 
hVR-2, and rVR-2 sequences to chromosomes is an important first step in correlating 
these sequences with genes associated with disease. 

5 Briefly, hVR-1 , hVR-2, and rVR-2 genes can be mapped to chromosomes by 

preparing PCR primers (preferably 15-25 bp in length) from the hVR-1, hVR-2, and 
rVR-2 nucleotide sequences. Computer analysis of the hVR-1, hVR-2, and rVR-2 
sequences can be used to predict primers that do not span more than one exon in the 
genomic DNA, thus complicating the amplification process. These primers can then be 

10 used for PCR screening of somatic cell hybrids containing individual human 

chromosomes. Only those hybrids containing the human gene corresponding to the 
hVR-h hVR-2, and rVR-2 sequences will yield an amplified fragment. 

Somatic cell hybrids are prepared by fusing somatic cells from different 
mammals (e.g., human and mouse cells). As hybrids of human and mouse cells grow 

15 and divide, they gradually lose human chromosomes in random order, but retain the 
mouse chromosomes. By using media in which mouse cells cannot grow, because they 
lack a particular enzyme, but human cells can, the one human chromosome that contains 
the gene encoding the needed enzyme, will be retained. By using various media, panels 
of hybrid cell lines can be established. Each cell line in a panel contains either a single 

20 human chromosome or a small number of human chromosomes, and a full set of mouse 
chromosomes, allowing easy mapping of individual genes to specific human 
chromosomes. (D'Eustachio P. et al (1983) Science 220:919-924). Somatic cell 
hybrids containing only fragments of human chromosomes can also be produced by 
using human chromosomes with translocations and deletions. 

25 PCR mapping of somatic cell hybrids is a rapid procedure for assigning a 

particular sequence to a particular chromosome. Three or more sequences can be 
assigned per day using a single thermal cycler. Using the hVR-1, hVR-2, and rVR-2 
nucleotide sequences to design oligonucleotide primers, sublocalization can be achieved 
with panels of fragments from specific chromosomes. Other mapping strategies which 

30 can similarly be used to map an hVR-1, hVR-2, and rVR-2 sequence to its chromosome 
include in situ hybridization (described in Fan, Y. et al (1990) Proc. Natl. Acad Set 
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USA, 87:6223-27), pre-screening with labeled flow-sorted chromosomes, and pre- 
selection by hybridization to chromosome specific cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in 
one step. Chromosome spreads can be made using cells whose division has been 
blocked in metaphase by a chemical such as colcemid that disrupts the mitotic spindle. 
The chromosomes can be treated briefly with trypsin, and then stained with Giemsa. A 
pattern of light and dark bands develops on each chromosome, so that the chromosomes 
can be identified individually. The FISH technique can be used with a DNA sequence 
as short as 500 or 600 bases. However, clones larger than 1,000 bases have a higher 
likelihood of binding to a unique chromosomal location with sufficient signal intensity 
for simple detection. Preferably 1,000 bases, and more preferably 2,000 bases will 
suffice to get good results at a reasonable amount of time. For a review of this 
technique, see Verma et al. , Human Chromosomes: A Manual of Basic Techniques 
(Pergamon Press, New York 1988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. Reagents corresponding to 
noncoding regions of the genes actually are for mapping purposes. Coding sequences 
are more likely to be conserved within gene families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with genetic map 
data. (Such data are found, for example, in V. McKusick, Mendelian Inheritance in 
Man, available on-line through Johns Hopkins University Welch Medical Library). The 
relationship between a gene and a disease, mapped to the same chromosomal region, can 
then be identified through linkage analysis (co-inheritance of physically adjacent genes), 
described in, for example, Egeland, J. et al. (1987) Nature, 325:783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected with a disease associated with the hVR-1 , hVR-2, and rVR-2 gene, can be 
determined. If a mutation is observed in some or all of the affected individuals but not 
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in any unaffected individuals, then the mutation is likely to be the causative agent of the 
particular disease. Comparison of affected and unaffected individuals generally involves 
first looking for structural alterations in the chromosomes, such as deletions or 
translocations that are visible from chromosome spreads or detectable using PCR based 
5 on that DNA sequence. Ultimately, complete sequencing of genes from several 
individuals can be performed to confirm the presence of a mutation and to distinguish 
mutations from polymorphisms. 



2. Tissue Typing 

1 0 The hVR- 1 , hVR-2, and rVR-2 sequences of the present invention can also be 

used to identify individuals from minute biological samples. The United States military, 
for example, is considering the use of restriction fragment length polymorphism (RFLP) 
for identification of its personnel. In this technique, an individual's genomic DNA is 
digested with one or more restriction enzymes, and probed on a Southern blot to yield 
1 5 unique bands for identification. This method does not suffer from the current limitations 
of "Dog Tags" which can be lost, switched, or stolen, making positive identification 
difficult. The sequences of the present invention are useful as additional DNA markers 
for RFLP (described in U.S. Patent 5,272,057). 

Furthermore, the sequences of the present invention can be used to provide an 
20 alternative technique which determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the hVR-1, hVR-2, and rVR-2 
nucleotide sequences described herein can be used to prepare two PCR primers from the 
5* and 3' ends of the sequences. These primers can then be used to amplify an 
individual's DNA and subsequently sequence it. 
25 Panels of corresponding DNA sequences from individuals, prepared in this 

manner, can provide unique individual identifications, as each individual will have a 
unique set of such DNA sequences due to allelic differences. The sequences of the 
present invention can be used to obtain such identification sequences from individuals 
and from tissue. The hVR-1, hVR-2, and rVR-2 nucleotide sequences of the invention 
30 uniquely represent portions of the human genome. Allelic variation occurs to some 

degree in the coding regions of these sequences, and to a greater degree in the noncoding 
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regions. It is estimated that allelic variation between individual humans occurs with a 
frequency of about once per each 500 bases. Each of the sequences described herein 
can, to some degree, be used as a standard against which DNA from an individual can be 
compared for identification purposes. Because greater numbers of polymorphisms occur 
in the noncoding regions, fewer sequences are necessary to differentiate individuals. 

If a panel of reagents from hVR-1, hVR-2, and rVR-2 nucleotide sequences 
described herein is used to generate a unique identification database for an individual, 
those same reagents can later be used to identify tissue from that individual. Using the 
unique identification database, positive identification of the individual, living or dead, 
can be made from extremely small tissue samples. 

3. Use of Partial hVR-1. hVR-2, and rVR-2 Sequ ences in Forensic Biology 
DNA-based identification techniques can also be used in forensic biology. 
Forensic biology is a scientific field employing genetic typing of biological evidence 
found at a crime scene as a means for positively identifying, for example, a perpetrator 
of a crime. To make such an identification, PCR technology can be used to amplify 
DNA sequences taken from very small biological samples such as tissues, e.g., hair or 
skin, or body fluids, e.g., blood, saliva, or semen found at a crime scene. The amplified 
sequence can then be compared to a standard, thereby allowing identification of the 
origin of the biological sample. 

The sequences of the present invention can be used to provide polynucleotide 
reagents, e.g., PCR primers, targeted to specific loci in the human genome, which can 
enhance the reliability of DNA-based forensic identifications by, for example, providing 
another "identification marker" (i.e. another DNA sequence that is unique to a particular 
individual). As mentioned above, actual base sequence information can be used for 
identification as an accurate alternative to patterns formed by restriction enzyme 
generated fragments. Examples of polynucleotide reagents include the hVR-1, hVR-2, 
and rVR-2 nucleotide sequences or portions thereof, e.g. , fragments derived from SEQ 
ID NO:l, 3, 4, 6, 7, 9, 10, or 1 1 having a length of at least 20 bases, preferably at least 
30 bases. 
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The hVR-1, hVR-2, and rVR-2 nucleotide sequences described herein can further 
be used to provide polynucleotide reagents, e.g., labeled or labelable probes which can 
be used in, for example, an in situ hybridization technique, to identify a specific tissue, 
e.g., brain tissue. This can be very useful in cases where a forensic pathologist is 
5 presented with a tissue of unknown origin. Panels of such h VR- 1 , h VR-2, and rVR-2 
probes can be used to identify tissue by species and/or by organ type. 

In a similar fashion, these reagents, e.g., hVR-1, hVR-2, and rVR-2 primers or 
probes can be used to screen tissue culture for contamination (/.e. screen for the presence 
of a mixture of different types of cells in a culture). 

10 

C. Predictive Medicine : 

The present invention also pertains to the field of predictive medicine in which 
diagnostic assays, prognostic assays, and monitoring clinical trials are used for 
prognostic (predictive) purposes to thereby treat an individual prophylactically. 

15 Accordingly, one aspect of the present invention relates to diagnostic assays for 

determining hVR-1, hVR-2, and rVR-2 protein and/or nucleic acid expression as well as 
hVR-1 , hVR-2, and rVR-2 activity, in the context of a biological sample (e.g., blood, 
serum, cells, tissue) to thereby determine whether an individual is afflicted with a 
disease or disorder, or is at risk of developing a disorder, associated with aberrant hVR- 

20 1 , hVR-2, and rVR-2 expression or activity. The invention also provides for prognostic 
(or predictive) assays for determining whether an individual is at risk of developing a 
disorder associated with hVR-1, hVR-2, and rVR-2 protein, nucleic acid expression or 
activity. For example, mutations in an hVR-l,hVR-2, and rVR-2 gene can be assayed 
in a biological sample. Such assays can be used for prognostic or predictive purpose to 

25 thereby phophylactically treat an individual prior to the onset of a disorder characterized 
by or associated with hVR-1, hVR-2, and rVR-2 protein, nucleic acid expression or 
activity. 

Another aspect of the invention pertains to monitoring the influence of agents 
(e.g., drugs, compounds) on the expression or activity of hVR-1, hVR-2, and rVR-2 in 
30 clinical trials. 

These and other agents are described in further detail in the following sections. 
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1. Diagnostic Assays 

An exemplary method for detecting the presence or absence of hVR-1, hVR-2, 
and rVR-2 protein or nucleic acid in a biological sample involves obtaining a biological 
sample from a test subject and contacting the biological sample with a compound or an 
5 agent capable of detecting h VR- 1 , hVR-2, and rVR-2 protein or nucleic acid (e.g. , 
mRNA, genomic DNA) that encodes hVR-1, hVR-2, and rVR-2 protein such that the 
presence of hVR-1, hVR-2, and rVR-2 protein or nucleic acid is detected in the 
biological sample. A agent for detecting hVR-1, hVR-2, and rVR-2 mRNA or genomic 
DNA is a labeled nucleic acid probe capable of hybridizing to hVR-1, hVR-2, and rVR- 

10 2 mRNA or genomic DNA. The nucleic acid probe can be, for example, a full-length 
hVR-1, hVR-2, and rVR-2 nucleic acid, such as the nucleic acid of SEQ ID NO:l, 3, 4, 
6, 7, 9, 10, or 12, or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 
100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under 
stringent conditions to hVR-1, hVR-2, and rVR-2 mRNA or genomic DNA. Other 

15 suitable probes for use in the diagnostic assays of the invention are described herein. 

An agent for detecting hVR-1 , hVR-2, and rVR-2 protein is an antibody capable 
of binding to hVR-1, hVR-2, and rVR-2 protein, preferably an antibody with a 
detectable label. Antibodies can be polyclonal, or more preferably, monoclonal. An 
intact antibody, or a fragment thereof (e.g. , Fab or F(ab r >2) can be used. The term 

20 "labeled", with regard to the probe or antibody, is intended to encompass direct labeling 
of the probe or antibody by coupling (i.e., physically linking) a detectable substance to 
the probe or antibody, as well as indirect labeling of the probe or antibody by reactivity 
with another reagent that is directly labeled. Examples of indirect labeling include 
detection of a primary antibody using a fluorescently labeled secondary antibody and 

25 end-labeling of a DNA probe with biotin such that it can be detected with fluorescently 
labeled streptavidin. The term "biological sample" is intended to include tissues, cells 
and biological fluids isolated from a subject, as well as tissues, cells and fluids present 
within a subject. That is, the detection method of the invention can be used to detect 
hVR-1, hVR-2, and rVR-2 mRNA, protein, or genomic DNA in a biological sample in 

30 vitro as well as in vivo. For example, in vitro techniques for detection of hVR-1, hVR-2, 
and rVR-2 mRNA include Northern hybridizations and in situ hybridizations. In vitro 
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techniques for detection of hVR-1, hVR-2, and rVR-2 protein include enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations and 
immunofluorescence. In vitro techniques for detection of h VR- 1 , hVR-2, and rVR-2 
genomic DNA include Southern hybridizations. Furthermore, in vivo techniques for 
5 detection of h VR- 1 , hVR-2, and rVR-2 protein include introducing into a subject a 
labeled anti-hVR-1 , hVR-2, and rVR-2 antibody. For example, the antibody can be 
labeled with a radioactive marker whose presence and location in a subject can be 
detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the 
1 0 test subject. Alternatively, the biological sample can contain mRNA molecules from the 
test subject or genomic DNA molecules from the test subject. A biological sample is a 
serum sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control 
biological sample from a control subject, contacting the control sample with a 
1 5 compound or agent capable of detecting hVR- 1 , hVR-2, and rVR-2 protein, mRNA, or 
genomic DNA, such that the presence of hVR-1, hVR-2, and rVR-2 protein, mRNA or 
genomic DNA is detected in the biological sample, and comparing the presence of hVR- 

1 , hVR-2, and rVR-2 protein, mRNA or genomic DNA in the control sample with the 
presence of hVR-1, hVR-2, and rVR-2 protein, mRNA or genomic DNA in the test 

20 sample. 

The invention also encompasses kits for detecting the presence of hVR-1, hVR- 

2, and rVR-2 in a biological sample. For example, the kit can comprise a labeled 
compound or agent capable of detecting hVR-1, hVR-2, and rVR-2 protein or mRNA in 
a biological sample; means for determining the amount of hVR-1, hVR-2, and rVR-2 in 

25 the sample; and means for comparing the amount of hVR-1 , hVR-2, and rVR-2 in the 
sample with a standard. The compound or agent can be packaged in a suitable container. 
The kit can further comprise instructions for using the kit to detect hVR-1, hVR-2, and 
rVR-2 protein or nucleic acid. 
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2. Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant 
h VR- 1 , hVR-2, and rVR-2 expression or activity. As used herein, the term "aberrant" 
5 includes an hVR-1 , hVR-2, and rVR-2 expression or activity which deviates from the 
wild type hVR-1, hVR-2, and rVR-2 expression or activity. Aberrant expression or 
activity includes increased or decreased expression or activity, as well as expression or 
activity which does not follow the wild type developmental pattern of expression or the 
subcellular pattern of expression. For example, aberrant hVR-1, hVR-2, and rVR-2 

1 0 expression or activity is intended to include the cases in which a mutation in the hVR- 1 , 
hVR-2, and rVR-2 gene causes the hVR-1, hVR-2, and rVR-2 gene to be under- 
expressed or over-expressed and situations in which such mutations result in a non- 
functional hVR-1, hVR-2, and rVR-2 protein or a protein which does not function in a 
wild-type fashion, e.g. , a protein which does not interact with an hVR-1 , hVR-2, and 

1 5 rVR-2 ligand or one which interacts with a non-hVR-1 , non-hVR-2, and non-rVR-2 
ligand. 

The assays described herein, such as the preceding diagnostic assays or the 
following assays, can be utilized to identify a subject having or at risk of developing a 
disorder associated with a misregulation in hVR-1, hVR-2, and rVR-2 protein activity or 

20 nucleic acid expression, such as a pain disorder. Alternatively, the prognostic assays can 
be utilized to identify a subject having or at risk for developing a disorder associated 
with a misregulation in hVR-1, hVR-2, and rVR-2 protein activity or nucleic acid 
expression, such as a pain disorder. Thus, the present invention provides a method for 
identifying a disease or disorder associated with aberrant hVR-1, hVR-2, and rVR-2 

25 expression or activity in which a test sample is obtained from a subject and hVR-1 , 
hVR-2, and rVR-2 protein or nucleic acid (e.g., mRNA or genomic DNA) is detected, 
wherein the presence of hVR-1, bVR-2, and rVR-2 protein or nucleic acid is diagnostic 
for a subject having or at risk of developing a disease or disorder associated with 
aberrant hVR-1 , hVR-2, and rVR-2 expression or activity. As used herein, a "test 

30 sample" refers to a biological sample obtained from a subject of interest. For example, a 
test sample can be a biological fluid (e.g., serum), cell sample, or tissue. 
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Furthermore, the prognostic assays described herein can be used to determine 
whether a subject can be administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) 
to treat a disease or disorder associated with aberrant hVR-1, hVR-2, and rVR-2 
5 expression or activity. For example, such methods can be used to determine whether a 
subject can be effectively treated with an agent for a pain disorder. Thus, the present 
invention provides methods for determining whether a subject can be effectively treated 
with an agent for a disorder associated with aberrant hVR-1, hVR-2, and rVR-2 
expression or activity in which a test sample is obtained and hVR-1, hVR-2, and rVR-2 
10 protein or nucleic acid expression or activity is detected (e.g., wherein the abundance of 
hVR-1, hVR-2, and rVR-2 protein or nucleic acid expression or activity is diagnostic for 
a subject that can be administered the agent to treat a disorder associated with aberrant 
hVR-1. hVR-2, and rVR-2 expression or activity). 

The methods of the invention can also be used to detect genetic alterations in an 
15 hVR-1, hVR-2, and rVR-2 gene, thereby determining if a subject with the altered gene is 
at risk for a disorder characterized by misregulation in hVR- 1 , hVR-2, and rVR-2 
protein activity or nucleic acid expression, such as a neurodegenerative disorder. In 
embodiments, the methods include detecting, in a sample of cells from the subject, the 
presence or absence of a genetic alteration characterized by at least one of an alteration 
20 affecting the integrity of a gene encoding an hVR-1, hVR-2, and rVR-2-protein, or the 
mis-expression of the hVR-1, hVR-2, and rVR-2 gene. For example, such genetic 
alterations can be detected by ascertaining the existence of at least one of 1) a deletion of 
one or more nucleotides from an hVR-1, hVR-2, and rVR-2 gene; 2) an addition of one 
or more nucleotides to an hVR-1, hVR-2, and rVR-2 gene; 3) a substitution of one or 
25 more nucleotides of an hVR-1, hVR-2, and rVR-2 gene, 4) a chromosomal 

rearrangement of an hVR-1, hVR-2, and rVR-2 gene; 5) an alteration in the level of a 
messenger RNA transcript of an hVR-1, hVR-2, and rVR-2 gene, 6) aberrant 
modification of an hVR-I, hVR-2, and rVR-2 gene, such as of the mediation pattern of 
the genomic DNA, 7) the presence of a non-wild type splicing pattern of a messenger 
30 RNA transcript of an hVR-1, hVR-2, and rVR-2 gene, 8) a non-wild type level of an 
hVR-1, hVR-2, and rVR-2-protein, 9) allelic loss of an hVR-1, hVR-2, and rVR-2 gene, 
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and 10) inappropriate post-translational modification of an hVR-1, hVR-2, and rVR-2- 
protein. As described herein, there are a large number of assays known in the art which 
can be used for detecting alterations in an hVR-1, hVR-2, and rVR-2 gene. A biological 
sample is a tissue or serum sample isolated by conventional means from a subject. 
5 In certain embodiments, detection of the alteration involves the use of a 

probe/primer in a polymerase chain reaction (PCR) (see, e.g., U.S. Patent Nos. 
4,683,195 and 4,683,202), such as anchor PCR or RACE PCR, or, alternatively, in a 
ligation chain reaction (LCR) (see, e.g., Landegran et al (1988) Science 241 :1 077-1 080; 
and Nakazawa et al (1 994) Proc. Natl Acad. Sci. USA 91 :360-364), the latter of which 

10 can be particularly useful for detecting point mutations in the hVR-1, hVR-2, and rVR- 
2-gene (see Abravaya et al (1995) Nucleic Acids Res .23:675-682). This method can 
include the steps of collecting a sample of cells from a subject, isolating nucleic acid 
(e.g., genomic, mRNA or both) from the cells of the sample, contacting the nucleic acid 
sample with one or more primers which specifically hybridize to an hVR-1 , hVR-2, and 

1 5 rVR-2 gene under conditions such that hybridization and amplification of the hVR-1 , 
hVR-2, and rVR-2-gene (if present) occurs, and detecting the presence or absence of an 
amplification product, or detecting the size of the amplification product and comparing 
the length to a control sample. It is anticipated that PCR and/or LCR may be desirable 
to use as a preliminary amplification step in conjunction with any of the techniques used 

20 for detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication 
(Guatelli, J.C. et al, (1990) Proc. Natl Acad Sci USA 87:1874-1878), transcriptional 
amplification system (Kwoh, D.Y. et al, (1989) Proc. Natl Acad Sci. USA 86:1 173- 
1 177), Q-Beta Replicase (Lizardi, P.M. et al. (1988) Bio-Technology 6: 1 197), or any 

25 other nucleic acid amplification method, followed by the detection of the amplified 
molecules using techniques well known to those of skill in the art. These detection 
schemes are especially useful for the detection of nucleic acid molecules if such 
molecules are present in very low numbers. 

In an alternative embodiment, mutations in an hVR-1, hVR-2, and rVR-2 gene 

30 from a sample cell can be identified by alterations in restriction enzyme cleavage 
patterns. For example, sample and control DNA is isolated, amplified (optionally), 
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digested with one or more restriction endonucleases, and fragment length sizes are 
determined by gel electrophoresis and compared. Differences in fragment length sizes 
between sample and control DNA indicates mutations in the sample DNA. Moreover, 
the use of sequence specific ribozymes (see, for example, U.S. Patent No. 5,498,531) ' 
5 can be used to score for the presence of specific mutations by development or loss of a 
ribozyme cleavage site. 

In other embodiments, genetic mutations in hVR-1, hVR-2, and rVR-2 can be 
identified by hybridizing a sample and control nucleic acids, e.g. , DNA or RNA, to high 
density arrays containing hundreds or thousands of oligonucleotides probes (Cronin 
10 U.l.etal. «m Human Mutation 7: 244-2S5;Kozal,MJ. etal. (1996) Nature 
Medicine 2: 753-759). For example, genetic mutations in hVR-1, hVR-2, and rVR-2 
can be identified in two dimensional arrays containing light-generated DNA probes as 
described in Cronin, M.T. etal. supra. Briefly, a first hybridization array of probes can 
be used to scan through long stretches of DNA in a sample and control to identify base 
15 changes between the sequences by making linear arrays of sequential overlapping 
probes. This step allows the identification of point mutations. This step is followed by 
a second hybridization array that allows the characterization of specific mutations by 
using smaller, specialized probe arrays complementary to all variants or mutations 
detected. Each mutation array is composed of parallel probe sets, one complementary to 
20 the wild-type gene and the other complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in 
the art can be used to directly sequence the hVR-I, hVR-2, and rVR-2 gene and detect 
mutations by comparing the sequence of the sample hVR-1, hVR-2, and rVR-2 with the 
corresponding wild-type (control) sequence. Examples of sequencing reactions include 
25 those based on techniques developed by Maxam and Gilbert ((1977) Proc. Natl. Acad. 
Sci. USA 74:560) or Sanger ((1977) Proc. Natl. Acad. Sci. USA 74:5463). It is also 
contemplated that any of a variety of automated sequencing procedures can be utilized 
when performing the diagnostic assays ((1995) Biotechniques 19:448), including 
sequencing by mass spectrometry (see, e.g., PCT International Publication No. WO 
30 94/16101; Cohen,/ al. (1996) Aav. Chromatogr. 36:127-162; and Griffin etal. (1993) 
Appl. Biochem. BiotechnoL 38:147-159). 
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Other methods for detecting mutations in the hVR-1, hVR-2, and rVR-2 gene 
include methods in which protection from cleavage agents is used to detect mismatched 
bases in RNA/RNA or RNA/DNA heteroduplexes (Myers et ai (1985) Science 
230:1242). In general, the art technique of "mismatch cleavage" starts by providing 
5 heteroduplexes of formed by hybridizing (labeled) RNA or DNA containing the wild- 
type hVR-1, hVR-2, and rVR-2 sequence with potentially mutant RNA or DNA 
obtained from a tissue sample. The double-stranded duplexes are treated with an agent 
which cleaves single-stranded regions of the duplex such as which will exist due to 
basepair mismatches between the control and sample strands. For instance, RNA/DNA 

1 0 duplexes can be treated with RNase and DNA/DNA hybrids treated with S 1 nuclease to 
enzymatically digesting the mismatched regions. In other embodiments, either 
DNA/DNA or RNA/DNA duplexes can be treated with hydroxylamine or osmium 
tetroxide and with piperidine in order to digest mismatched regions. After digestion of 
the mismatched regions, the resulting material is then separated by size on denaturing 

15 polyacrylamide gels to determine the site of mutation. See, for example, Cotton et ai 
(1988) Proc. Natl Acad Sci USA 85:4397; Saleeba et at. (1992) Methods Enzymoi 
217:286-295. In a embodiment, the control DNA or RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or 
more proteins that recognize mismatched base pairs in double-stranded DNA (so called 

20 "DNA mismatch repair" enzymes) in defined systems for detecting and mapping point 
mutations in hVR-1, hVR-2, and rVR-2 cDNAs obtained from samples of cells. For 
example, the mutY enzyme of E. coli cleaves A at G/A mismatches and the thymidine 
DNA glycosylase from HeLa cells cleaves T at G/T mismatches (Hsu et al (1994) 
Carcinogenesis 15:1657-1662). According to an exemplary embodiment, a probe based 

25 on an hVR-1, hVR-2, and rVR-2 sequence, e.g., a wild-type hVR-1, hVR-2, and rVR-2 
sequence, is hybridized to a cDNA or other DNA product from a test cell(s). The duplex 
is treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be 
detected from electrophoresis protocols or the like. See, for example, U.S. Patent No. 
5,459,039. 
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In other embodiments, alterations in electrophoretic mobility will be used to 
.dentify mutations in hVR-1, hVR-2, and rVR-2 gene, For example, single stra „ d 
conformation polymorphism (SSCP) may be used to detect differs in electrophoretic 
momhty between mutant and wild type nucleic acids (orita et al. (1 989) Proc Natl. 
5 Acad. Sci USA: 86:2766, see also Cotton (1993) Mutat. Res. 285:125-144; and Hayashi 
(1992) Genet. Anal. Tech. Appl. 9:73-79). Single-stranded DNA fragments of sample 
and control hVR-1 , hVR-2, and rVR-2 nucleic acids will be denatured and allowed to 
renature. The secondary structure of single-stranded nucleic acids varies according to 
sequence, the resulting alteration in electrophoretic mobility enables the detection of 
10 even a single base change. The DNA fragments may be labeled or detected with labeled 
probes. The sensitivity of the assay may be enhanced by using RNA (rather than DNA) 
m which the secondary structure is more sensitive to a change in sequence In a 
embodiment, the subject method utilizes heteroduplex analysis to separate double 
stranded heteroduplex molecules on the basis of changes in electrophoretic mobility 
15 (Keen et al. (1991) Trends Genet 7:5). 

In yet another embodiment the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al. (1985) JWe 313:495) When 
DGGE is used as the method of analysis, DNA will be modified to insure that it does not 
20 completely denature, for example by adding a GC clamp of approximately 40 bp of 
htgh-melting GC-rich DNA by PGR. In a further embodiment, a temperature gradient is 
used m place of a denaturing gradient to identify differences in the mobility of control 
and sample DNA (Rosenbaum and Reissner (1987) Biophys Chem 265:12753). 

Examples of other techniques for detecting point mutations include, but are not 
limited to, selective oligonucleotide hybridization, selective amplification, or selective 
pnmer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
wmch permit hybridization only if a perfect match is found (Saiki */ al. ( 1 986) Nature 
324:163); Saiki et al. (1989) Proc. Natl Acad. Sci USA 86:6230). Such allele specific 
30 ohgonucleotides are hybridized to PGR amplified target DNA or a number of different 



25 
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mutations when the oligonucleotides are attached to the hybridizing membrane and 
hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology which depends on selective 
PCR amplification may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific amplification may carry the mutation of 
interest in the center of the molecule (so that amplification depends on differential 
hybridization) (Gibbs et al (1989) Nucleic Acids Res. 17:2437-2448) or at the extreme 3' 
end of one primer where, under appropriate conditions, mismatch can prevent, or reduce 
polymerase extension (Prossner (1993) Tibtech 1 1 :238). In addition it may be desirable 
to introduce a novel restriction site in the region of the mutation to create cleavage-based 
detection (Gasparini et al (1992) Mol Cell Probes 6:1). It is anticipated that in certain 
embodiments amplification may also be performed using Taq ligase for amplification 
(Barany (1991) Proc. Natl Acad. Sci USA 88:189). In such cases, ligation will occur 
only if there is a perfect match at the 3' end of the 5' sequence making it possible to detect 
the presence of a known mutation at a specific site by looking for the presence or absence 
of amplification. 

The methods described herein may be performed, for example, by utilizing pre- 
packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
described herein, which may be conveniently used, e.g., in clinical settings to diagnose 
patients exhibiting symptoms or family history of a disease or illness involving an hVR- 
l,hVR-2, and rVR-2 gene. 

Furthermore, any cell type or tissue in which hVR-1, hVR-2, and rVR-2 is 
expressed may be utilized in the prognostic assays described herein. 

3. Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents (e.g., drugs) on the expression or activity of 
an hVR-1, hVR-2, and rVR-2 protein can be applied not only in basic drug screening, 
but also in clinical trials. For example, the effectiveness of an agent determined by a 
screening assay as described herein to increase hVR-1 , hVR-2, and rVR-2 gene 
expression, protein levels, or upregulate hVR-1, hVR-2, and rVR-2 activity, can be 
monitored in clinical trials of subjects exhibiting decreased hVR-1, hVR-2, and rVR-2 
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gene expression, protein levels, or downregulated hVR-1, hVR-2, and rVR-2 activity. 
Alternatively, the effectiveness of an agent determined by a screening assay to decrease 
hVR-1, hVR-2, and rVR-2 gene expression, protein levels, or downregulate hVR-1, 
hVR-2, and rVR-2 activity, can be monitored in clinical trials of subjects exhibiting 
increased hVR-1, hVR-2, and rVR-2 gene expression, protein levels, or upregulated 
hVR-1 , hVR-2, and rVR-2 activity. In such clinical trials, the expression or activity of 
an hVR-1, hVR-2, and rVR-2 gene, and preferably, other genes that have been 
implicated in, for example, an hVR-1, hVR-2, and rVR-2-associated disorder can be 
used as a "read out" or markers of the phenotype of a particular cell. 

For example, and not by way of limitation, genes, including hVR-1, hVR-2, and 
rVR-2, that are modulated in cells by treatment with an agent (e.g., compound, drug or 
small molecule) which modulates hVR-1, hVR-2, and rVR-2 activity (e.g., identified in 
a screening assay as described herein) can be identified. Thus, to study the effect of 
agents on hVR-1, hVR-2, and rVR-2-associated disorders (e.g., pain disorders), for 
example, in a clinical trial, cells can be isolated and RNA prepared and analyzed for the 
levels of expression of hVR-1, hVR-2, and rVR-2 and other genes implicated in the 
hVR-1, hVR-2, and rVR-2-associated disorder, respectively. The levels of gene 
expression (e.g., a gene expression pattern) can be quantified by northern blot analysis 
or RT-PCR, as described herein, or alternatively by measuring the amount of protein 
produced, by one of the methods as described herein, or by measuring the levels of 
activity of hVR-l, hVR-2, and rVR-2 or other genes. In this way, the gene expression 
pattern can serve as a marker, indicative of the physiological response of the cells to the 
agent. Accordingly, this response state may be determined before, and at various points 
during treatment of the individual with the agent. 

In a embodiment, the present invention provides a method for monitoring the 
effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate 
identified by the screening assays described herein) including the steps of (i) obtaining a 
pre-administratton sample from a subject prior to administration of the agent; (ii) 
detecting the level of expression of an hVR-1, hVR-2, and rVR-2 protein, mRNA, or 
genomic DNA in the preadministration sample; (iii) obtaining one or more post- 
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administration samples from the subject; (iv) detecting the level of expression or activity 
of the hVR-1 , hVR-2, and rVR-2 protein, mRNA, or genomic DNA in the post- 
administration samples; (v) comparing the level of expression or activity of the hVR-1 , 
hVR-2, and rVR-2 protein, mRNA, or genomic DNA in the pre-administration sample 
5 with the hVR-1 , hVR-2, and rVR-2 protein, mRNA, or genomic DNA in the post 

administration sample or samples; and (vi) altering the administration of the agent to the 
subject accordingly. For example, increased administration of the agent may be 
desirable to increase the expression or activity of hVR-1, hVR-2, and rVR-2 to higher 
levels than detected, i.e., to increase the effectiveness of the agent. Alternatively, 
1 0 decreased administration of the agent may be desirable to decrease expression or activity 
of hVR-1, hVR-2, and rVR-2 to lower levels than detected, Le. to decrease the 
effectiveness of the agent. According to such an embodiment, hVR-1, hVR-2, and rVR- 
2 expression or activity may be used as an indicator of the effectiveness of an agent, 
even in the absence of an observable phenotypic response. 

15 

D. Methods of Treatment : 

The present invention provides for both prophylactic and therapeutic methods of 
treating a subject at risk of (or susceptible to) a disorder or having a disorder associated 
with aberrant hVR-1, hVR-2, and rVR-2 expression or activity. With regards to both 

20 prophylactic and therapeutic methods of treatment, such treatments may be specifically 
tailored or modified, based on knowledge obtained from the field of pharmacogenomics. 
"Pharmacogenomics", as used herein, refers to the application of genomics technologies 
such as gene sequencing, statistical genetics, and gene expression analysis to drugs in 
clinical development and on the market. More specifically, the term refers the study of 

25 how a patient f s genes determine his or her response to a drug (e.g., a patient's "drug 
response phenotype", or "drug response genotype".) Thus, another aspect of the 
invention provides methods for tailoring an individual's prophylactic or therapeutic 
treatment with either the hVR-1, hVR-2, and rVR-2 molecules of the present invention 
or hVR-1 , hVR-2, and rVR-2 modulators according to that individual's drug response 

30 genotype. Pharmacogenomics allows a clinician or physician to target prophylactic or 
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therapeutic treatments to patients who will most benefit fn>m the treatment and to avoid 
treatment of patients who will experience toxic drug-related side effects. 

!• Prophylactic Methods 
5 In one aspect, the invention provides a method for preventing in a subject a 

disease or condition associated with an aberrant hVR-1 , hVR-2, and rVR-2 expression 
or activity, by administering to the subject an hVR-I, hVR-2, and rVR-2 or an agent 
which modulates hVR-1 , hVR-2, and rVR-2 expression or at least one hVR-1 hVR-2 
and rVR-2 activity. Subjects at risk for a disease which is caused or contributed to by 
10 aberrant hVR-1. hVR-2, and rVR-2 expression or activity can be identified by, for 
example, any or a combination of diagnostic or prognostic assays as described herein 
Administration of a prophylactic agent can occur prior to the manifestation of symptoms 
characteristic of the hVR-1 , hVR-2, and rVR-2 aberrancy, such that a disease or disorder 
•s prevented or, alternatively, delayed in its passion. Depending on the type of hVR- 
15 1, hVR-2, and rVR-2 aberrancy, for example, an hVR-1, hVR-2, and rVR-2, hVR-1 
hVR-2, and rVR-2 agonist or hVR-1 , hVR-2, and rVR-2 antagonist agent can be used 
for treating the subject. The appropriate agent can be determined based on screening 
assays described herein. 



20 2. Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating hVR-1, hVR- 
2, and rVR-2 expression or activity for therapeutic purposes. Accordingly, in an 
exemplary embodiment, the modulatory method of the invention involves contacting a 
cell with an hVR-1 , hVR-2, and rVR-2 or agent that modulates one or more of the 
25 activities ofhVR-I, hVR-2, and rVR-2 protein activity associated with the cell An 
agent that modulates hVR-1, hVR-2, and rVR-2 protein activity can be an agent as 
described herein, such as a nucleic acid or a protein, a iiatim^ly-occurring target 
molecule of an hVR-1, hVR-2, and rVR-2 protein (e.g., an hVR-1, hVR-2, and rVR-2 
substtate), an hVR-1, hVR-2, and rVR-2 antibody, an hVR-1, hVR-2, and rVR-2 agonist 
30 or antagonist, a peptidomimetic of an hVR-1, hVR-2, and rVR-2 agonist or antagonist 
or other small molecule. In one embodiment, the agent stimulates one or more hVR- 1 ' 
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hVR-2, and rVR-2 activities. Examples of such stimulatory agents include active hVR- 

1, hVR-2, and rVR-2 protein and a nucleic acid molecule encoding hVR-1, hVR-2, and 
rVR-2 that has been introduced into the cell. In another embodiment, the agent inhibits 
one or more hVR-1 , hVR-2, and rVR-2 activities. Examples of such inhibitory agents 

5 include antisense h VR- 1 , hVR-2, and rVR-2 nucleic acid molecules, anti-h VR- 1 , h VR- 

2, and rVR-2 antibodies, and hVR-1, hVR-2, and rVR-2 inhibitors. These modulatory 
methods can be performed in vitro (e.g., by culturing the cell with the agent) or, 
alternatively, in vivo (e.g., by administering the agent to a subject). As such, the present 
invention provides methods of treating an individual afflicted with a disease or disorder 

10 characterized by aberrant expression or activity of an hVR-1, hVR-2, and rVR-2 protein 
or nucleic acid molecule. In one embodiment, the method involves administering an 
agent (e.g., an agent identified by a screening assay described herein), or combination of 
agents that modulates (e.g.* upregulates or downregulates) hVR-1, hVR-2, and rVR-2 
expression or activity. In another embodiment, the method involves administering an 

1 5 hVR- 1 , hVR-2, and rVR-2 protein or nucleic acid molecule as therapy to compensate for 
reduced or aberrant hVR-1, hVR-2, and rVR-2 expression or activity. 

Stimulation of hVR-1, hVR-2, and rVR-2 activity is desirable in situations in 
which hVR-1, hVR-2, and rVR-2 is abnormally downregulated and/or in which 
increased hVR-1, hVR-2, and rVR-2 activity is likely to have a beneficial effect. For 

20 example, stimulation of hVR- 1 , hVR-2, and rVR-2 activity is desirable in situations in 
which an hVR-1, hVR-2, and rVR-2 is downregulated and/or in which increased hVR-1, 
hVR-2, and rVR-2 activity is likely to have a beneficial effect. Likewise, inhibition of 
hVR-1, hVR-2, and rVR-2 activity is desirable in situations in which hVR-1, hVR-2, 
and rVR-2 is abnormally upregulated and/or in which decreased hVR-1, hVR-2, and 

25 rVR-2 activity is likely to have a beneficial effect. 



3. Pharmacogenomics 

The hVR-1, hVR-2, and rVR-2 molecules of the present invention, as well as 
agents, or modulators which have a stimulatory or inhibitory effect on hVR-1, hVR-2, 
30 and rVR-2 activity (e.g. , hVR-1, hVR-2, and rVR-2 gene expression) as identified by a 
screening assay described herein can be administered to individuals to treat 
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(prophylactically or therapeutically) hVR-1, hVR-2, and rVR-2-associated disorders 
(e.g., pain disorders) associated with aberrant hVR-1, hVR-2, and rVR-2 activity. In 
conjunction with such treatment, pharmacogenomics (i.e., the study of the relationship 
between an individual's genotype and that individual's response to a foreign compound 
or drug) may be considered. Differences in metabolism of therapeutics can lead to 
severe toxicity or therapeutic failure by altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, a physician or clinician may 
consider applying knowledge obtained in relevant pharmacogenomics studies in 
determining whether to administer an hVR-1, hVR-2, and rVR-2 molecule or hVR-1, 
hVR-2, and rVR-2 modulator as well as tailoring the dosage and/or therapeutic regimen 
of treatment with an hVR-1, hVR-2, and rVR-2 molecule or hVR-1, hVR-2, and rVR-2 
modulator. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected 
persons. See, for example, Eichelbaum, M. et al (1996) Clin. Exp. Pharmacol Physiol 
23(10-1 1) :983-985 and Under, M.W. et al. (1997) Clin. Chem. 43(2):254-266. In 
general, two types of pharmacogenetic conditions can be differentiated. Genetic 
conditions transmitted as a single factor altering the way drugs act on the body (altered 
drug action) or genetic conditions transmitted as single factors altering the way the body 
acts on drugs (altered drug metabolism). These pharmacogenetic conditions can occur 
either as rare genetic defects or as naturally-occurring polymorphisms. For example, 
glucose-6-phosphate dehydrogenase deficiency (G6PD) is a common inherited 
enzymopathy in which the main clinical complication is haemolysis after ingestion of 
oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and consumption of 
fava beans. 

One pharmacogenomics approach to identifying genes that predict drug 
response, known as "a genome-wide association", relies primarily on a high-resolution 
map of the human genome consisting of already known gene-related markers (e.g., a "bi- 
allelic" gene marker map which consists of 60,000-100,000 polymorphic or variable 
sites on the human genome, each of which has two variants.) Such a high-resolution 
genetic map can be compared to a map of the genome of each of a statistically 
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significant number of patients taking part in a Phase II/III drug trial to identify markers 
associated with a particular observed drug response or side effect. Alternatively, such a 
high resolution map can be generated from a combination of some ten-million known 
single nucleotide polymorphisms (SNPs) in the human genome. As used herein, a 
"SNP" is a common alteration that occurs in a single nucleotide base in a stretch of 
DNA. For example, a SNP may occur once per every 1000 bases of DNA. A SNP may 
be involved in a disease process, however, the vast majority may not be disease- 
associated. Given a genetic map based on the occurrence of such SNPs, individuals can 
be grouped into genetic categories depending on a particular pattern of SNPs in their 
individual genome. In such a manner, treatment regimens can be tailored to groups of 
genetically similar individuals, taking into account traits that may be common among 
such genetically similar individuals. 

Alternatively, a method termed the "candidate gene approach", can be utilized to 
identify genes that predict drug response. According to this method, if a gene that 
encodes a drugs target is known (e.g., an hVR-1, hVR-2, and rVR-2 protein of the 
present invention), all common variants of that gene can be fairly easily identified in the 
population and it can be determined if having one version of the gene versus another is 
associated with a particular drug response. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 
2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated 
drug response and serious toxicity after taking the standard and safe dose of a drug. 
These polymorphisms are expressed in two phenotypes in the population, the extensive 
metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 
different populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently 
experience exaggerated drug response and side effects when they receive standard doses. 
If a metabolite is the active therapeutic moiety, PM show no therapeutic response, as 
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demonstrated for the analgesic effect of codeine mediated by its C YP2D6-formed 
metabolite morphine. The other extreme are the so called ultra-rapid metabolizers who 
do not respond to standard doses. Recently, the molecular basis of ultra-rapid 
metabolism has been identified to be due to CYP2D6 gene amplification. 

Alternatively, a method termed the "gene expression profiling", can be utilized to 
identify genes that predict drug response. For example, the gene expression of an 
animal dosed with a drug (e.g., an hVR-1, hVR-2, and rVR-2 molecule or hVR-1, hVR- 
2, and rVR-2 modulator of the present invention) can give an indication whether gene 
pathways related to toxicity have been turned on. 

Information generated from more than one of the above pharmacogenomics 
approaches can be used to determine appropriate dosage and treatment regimens for 
prophylactic or therapeutic treatment an individual. This knowledge, when applied to 
dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus 
enhance therapeutic or prophylactic efficiency when treating a subject with an hVR-1, 
hVR-2, and rVR-2 molecule or hVR-1, hVR-2, and rVR-2 modulator, such as a 
modulator identified by one of the exemplary screening assays described herein. 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this application, as well as the Figures and the Sequence 
Listing are incorporated herein by reference. 

EXAMPLES 

EXAMPLE 1: IDENTIFICATION AND CHARACTERIZATION 

OF hVR-1, hVR-2, and rVR-2 cDNA 

In this example, the identification and characterization of the genes encoding 
hVR-1 (clone Fchrb87a6), hVR-2 (clone flh21el 1), hVR-2 alternate form (clone 
frhobl2c4), and rVR-2 (clone flrxb!47gl 1) are described. 



WO 00/29577 PCIYUS99/26701 

-89- 

Isolation of the hVR-h hVR-2, and the rVR-2 cDNA 

The invention is based, at least in part, on the discovery of two human genes and 
one rat gene encoding novel members of the Capsaicin/Vanilloid receptor family, 
referred to herein as hVR-1, hVR-2, and rVR-2, respectively. These clones were 
identified from a human heart library and a rat dorsal root ganglion (DRG) library, based 
on sequence homology to the known rat VR-1 (Accession Number AF029310). The 
sequence of the two human clones and the rat clone was determined and found to 
contain open reading frames. 

The nucleotide sequence of the full length hVR-1 cDNA and the predicted amino 
acid sequence of the hVR-1 polypeptide are shown in Figure 1 and in SEQ ID NOs: 1 
and 2, respectively. 

The nucleotide sequence of the full length hVR-2 cDNA and the predicted amino 
acid sequence of the hVR-2 polypeptide are shown in Figure 2 and in SEQ ID NOs:4 
and 5, respectively. 

The nucleotide sequence of the partial hVR-2 (alternate form) cDNA and the 
predicted amino acid sequence of the hVR-2 (alternate form) polypeptide are shown in 
Figure 3 and in SEQ ID NOs:7 and 8, respectively. 

The amino acid sequence of the predicted full length human VR-2 protein 
(alternate form) is shown in Figure 16 and in SEQ ID NO:20. 

The nucleotide sequence of the partial rVR-2 cDNA and the predicted amino 
acid sequence of the rVR-2 polypeptide are shown in Figure 4 and in SEQ ID NOs: 10 
and Irrespectively. 

Analysis of the hVR-1, hVR-2, and rVR-2 Molecules 

The hVR-1 protein (SEQ ID NO:2) was aligned with the human VR-2 protein 
(SEQ ID NO:5) using the GAP program in the GCG software package (Blosum 62 
matrix) and a gap weight of 12 and a length weight of 4. The results showed a 46.348% 
identity and 55.378% similarity between the two sequences (see Figure 5). 

The hVR-1 nucleotide sequence (SEQ ID NO:l) was aligned with the human 
VR-2 nucleotide sequence (SEQ ID NO:4) using the GAP program in the GCG software 
package (nwsgapdna matrix) and a gap weight of 50 and a length weight of 3. The 



WO 00/29577 



-90- 



PCT7US99/26701 



results showed a 55.316% identity and 55.316% similarity between the two sequences 
(see Figure 6). 

The hVR-2 protein (SEQ ID NO:5) was aligned with the rat VR-2 protein (SEQ 
ID NO:l 1) using the CLUSTAL W (1 .74) multiple sequence alignment program (Figure 
5 7), as well as using the GAP program in the GCG software package (Blosum 62 matrix) 
and a gap weight of 12 and a length weight of 4. The results showed a 79.167% identity 
and 81.703% similarity between the two sequences (see Figure 8). 

The hVR-1 nucleotide sequence (SEQ ID NO:l) was aligned with the rat VR-1 
nucleotide sequence (Accession Number: AF0293 10) using the GAP program in the 
10 GCG software package (nwsgapdna matrix) and a gap weight of 50 and a length weight 
of 3. The results showed a 82.125% identity and 82.125% similarity between the two 
sequences (see Figure 9). 

The hVR-1 protein (SEQ ID NO:2) was aligned with the rat VR-1 protein 
(Accession Number: AF0293 10) using the GAP program in the GCG software package 
1 5 (Blosum 62 matrix) and a gap weight of 12 and a length weight of 4. The results showed 
a 86.022% identity and 89.247% similarity between the two sequences (see Figure 10). 

The hVR-2 protein (SEQ ID NO:5) was aligned with the human VR-2 protein 
(alternate form) (SEQ ID NO:8) using the CLUSTAL W (1 .74) multiple sequence 
alignment program (Figure 1 1). 
20 Finally, the hVR-2 protein (SEQ ID NO:5) was aligned with the predicted full 

length human VR-2 protein (alternate form) (SEQ ID NO:20) using the CLUSTAL W 
(1.74) multiple sequence alignment program (Figure 17). 

A search was performed against the HMM database resulting in the identification 
of three ankyrin repeat domains in the amino acid sequence of h VR-1 (SEQ ID NO:2) at 
25 about residues 201-233, 248-283, and 333-361, and in the amino acid sequence of hVR- 
2 (SEQ ID NO:5) at about residues 162-194, 208-243, and 293-328. The results of the 
searches are set forth in Figures 13 and 15, respectively. 

Hydropathy plots have identified 6 transmembrane domains in the hVR-1 and 
the hVR-2 proteins (see Figures 12 and 14, respectively). 
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A series of searches have revealed that the hVR-1 protein matches the ProDom 
entry 141801 for the vanilloid receptor subtype and the ProDom entry 145518 for the 
vanilloid receptor subtype. 

Moreover, a search was performed against the Prosite database resulting in the 
identification of four N-glycosylation sites in the amino acid sequence of SEQ ID NO:5 
(at about residues 171-174, 192-195, 604-607, and 749-752), three cGMP-dependent 
protein kinase phosphorylation sites in the amino acid sequence of SEQ ID NO:5 (at 
about residues 2-5, 368-371, and 499-502), a series of protein kinase C and Casein 
kinase II phosphorylation sites in the amino acid sequence of SEQ ID NO:5, two 
tyrosine kinase phosphorylation sites in the amino acid sequence of SEQ ID NO:5 (at 
about residues 368-375 and 622-628), and two myristoylation sites in the amino acid 
sequence of SEQ ID NO:5 (at about residues 169-174 and 765-770). 

Tissue Distribution of hVR-1 and hVR-2 mRNA 

This Example describes the tissue distribution of hVR-1 and hVR-2 mRNA as 
determined by in situ hybridization. 

For in situ analysis, tissues, such as brain regions and whole brain, obtained from 
human and monkey were first frozen on dry ice. Ten-micrometer-thick coronal sections 
of the tissues were postfixed with 4% formaldehyde in DEPC treated IX phosphate- 
buffered saline at room temperature for 10 minutes before being rinsed twice in DEPC 
IX phosphate-buffered saline and once in 0.1 M triethanolamine-HCl (pH 8.0). 
Following incubation in 0.25% acetic anhydride-0.1 M triethanolamine-HCl for 10 
minutes, sections were rinsed in DEPC 2X SSC (IX SSC is 0.15M NaCl plus 0.01 5M 
sodium citrate). Tissue was then dehydrated through a series of ethanol washes, 
incubated in 100% chloroform for 5 minutes, and then rinsed in 100% ethanol for 1 
minute and 95% ethanol for 1 minute and allowed to air dry. 

Hybridizations were performed with 35 S-radiolabeled (5 X 10 7 cpm/ml) cRNA 
probes. Probes were incubated in the presence of a solution containing 600 mM NaCl, 
10 mM Tris (pH 7.5), 1 mM EDTA, 0.01% sheared salmon sperm DNA, 0.01% yeast 
tRNA, 0.05% yeast total RNA type XI, 1 X Denhardt's solution, 50% formamide, 10% 
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dextran sulfate, 100 mM dithiothreitol, 0.1% sodium dodecyl sulfate (SDS), and 0.1% 
sodium thiosulfate for 1 8 hours at 55°C. 

After hybridization, slides were washed with 2 X SSC. Sections were then 
sequentially incubated at 37°C in TOE (a solution containing 10 mM Tris-HCl <pH 7.6) 
5 500 mM NaCl, and 1 mM EDTA), for 10 minutes, in TNE with l 0fl g of RNase A per ml 
for 30 minutes, and finally in TNE for 10 minutes. Slides were then rinsed with 2 X 
SSC at room temperature, washed with 2 X SSC at 50°C for 1 hour, washed with 0.2 X 
SSC at 55°C for 1 hour, and 0.2 X SSC at 60°C for 1 hour. Sections were then 
dehydrated rapidly through serial ethanol-0.3 M sodium acetate concentrations before 
1 0 being air dried and exposed to Kodak Biomax MR scientific imaging film for 24 hours 
and subsequently dipped in NB-2 photoemulsion and exposed at 4°C for 7 days before 
being developed and counter stained. 

The data indicate that the hVR-1 molecule is not expressed in human nor 
monkey brain. The hVR-1 molecule is expressed in nodose, trigeminal sensory neurons 
1 5 but „ not expressed in sympathetic neurons. Within the nodose sensory neurons and 
trigeminal sensory neurons, expression was seen in distinct sub-populations. Moreover 
hVRl is expressed in some, but not all, small dorsal root ganglion (ORG) neurons and in 
a few medium sized ORG neurons. The hVR-1 mo ,ecule is partially co-expressed with 
the neuropeptide CGRP and with substance P which are present in nociceptive neurons 
20 The data further indicate that the VR-2 molecule is expressed in both human and 

monkey brain, primarily in cortical neurons. The VR2 molecule is also expressed in 
other brain regions, for example, the thalamus, striatum, hippocampus, hypothalamus, 
midbrain, medula and brain stem. In addition, the VR-2 molecule is expressed in 
parasympathetic neurons of the monkey heart (atrium), nodose sensory neurons, 
25 trigeminal (TRG) sensory neurons, dorsal root gangJion sensory neurons, sympathetic 
neurons, and motor neurons of the spinal cord. The VR2 molecule is widely expressed 
in TRG and ORG neurons, being present in most small and medium sized neurons and 
also in a few of the large neurons. VR2, like VR-1 , partially co-localizes with CGRP 
and substance P. 
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Trigeminal sensory neurons are recognized pain centers while sympathetic 
neurons have been shown to be involved in neuropathic pain. 

EXAMPLE 2: EXPRESSION OF RECOMBINANT hVR-1, bVR-2, AND 
5 rVR-2 PROTEIN IN BACTERIAL CELLS 

In this example, hVR-1, hVR-2, and rVR-2 is expressed as a recombinant 
glutathione-S -transferase (GST) fusion polypeptide in E. coli and the fusion polypeptide 
is isolated and characterized. Specifically, hVR- 1 , hVR-2, and rVR-2 is fused to GST 
and this fusion polypeptide is expressed in E. coli, e.g., strain PEB199. Expression of 
1 0 the GST-hVR- 1 , GST-hVR-2, and GST-rVR-2 fusion protein in PEB 1 99 is induced 
with IPTG. The recombinant fusion polypeptide is purified from crude bacterial lysates 
of the induced PEB 1 99 strain by affinity chromatography on glutathione beads. Using 
polyacrylamide gel electrophoretic analysis of the polypeptide purified from the 
bacterial lysates, the molecular weight of the resultant fusion polypeptide is determined. 

15 

EXAMPLE 3: EXPRESSION OF RECOMBINANT hVR-1, hVR-2, 
AND rVR-2 PROTEIN IN COS CELLS 

To express the hVR-1 , hVR-2, and rVR-2 gene in COS cells, the pcDN A/Amp 
vector from Invitrogen Corporation (San Diego, CA) is used. This vector contains an 

20 SV40 origin of replication, an ampicillin resistance gene, an E. coli replication origin, a 
CMV promoter followed by a polylinker region, and an SV40 intron and 
polyadenylation site. A DNA fragment encoding the entire hVR-1, hVR-2, and rVR-2 
protein and an HA tag (Wilson et al (1 984) Cell 37:767) or a FLAG tag fused in-frame 
to its 3 ! end of the fragment is cloned into the polylinker region of the vector, thereby 

25 placing the expression of the recombinant protein under the control of the CMV 
promoter. 

To construct the plasmid, the hVR-1, hVR-2, and rVR-2 DNA sequence is 
amplified by PCR using two primers. The 5' primer contains the restriction site of 
interest followed by approximately twenty nucleotides of the hVR-1 , hVR-2, and rVR-2 
30 coding sequence starting from the initiation codon; the 3' end sequence contains 
complementary sequences to the other restriction site of interest, a translation stop 
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codon, the HA tag or FLAG tag and the last 20 nucleotides of the hVR-1, hVR-2, and 
rVR-2 coding sequence. The PGR amplified fragment and the pCDNA/Amp vector are 
digested with the appropriate restriction enzymes and the vector is dephosphorylated 
using the CIAP enzyme (New England Biolabs, Beverly, MA). Preferably the two 
restriction sites chosen are different so that the hVR-1, hVR-2, and rVR-2 gene is 
inserted in the correct orientation. The ligation mixture is transformed into £ coli cells 
(strains HB101, DH5a, SURE, available from Stratagene Cloning Systems, La Jolla, 
CA, can be used), the transformed culture is plated on ampicillin media plates, and 
resistant colonies are selected. Plasmid DNA is isolated from transformants and 
examined by restriction analysis for the presence of the correct fragment 

COS cells are subsequently transfected with the hVR-1, hVR-2, and rVR-2- 
pcDNA/Amp plasmid DNA using the calcium phosphate or calcium chloride co- 
precipitation methods, DEAE-dextran-mediated transfection, lipofection, or 
electroporation. Other suitable methods for transfecting host cells can be found in 
Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual 
2nd, ed f Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989. The expression of the hVR-1, hVR-2, and rVR-2 polypeptide 
is detected by radiolabelling ( 35 S-methionine or 35 S-cysteine available from NEN, 
Boston, MA, can be used) and immunoprecipitation (Harlow, E. and Lane, D. 
Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1988) using an HA specific monoclonal antibody. Briefly, the cells are 
labelled for 8 hours with 35 S-methionine (or 35 S-cysteine). The culture media are then 
collected and the cells are lysed using detergents (RIPA buffer, 150 mM NaCl, 1% NP- 
40, 0.1% SDS, 0.5% DOC, 50 mM Tris, pH 7.5). Both the cell lysate and the culture 
media are precipitated with an HA specific monoclonal antibody. Precipitated 
polypeptides are then analyzed by SDS-PAGE. 

Alternatively, DNA containing the hVR-1 , hVR-2, and rVR-2 coding sequence 
is cloned directly into the polylinker of the pCDNA/Amp vector using the appropriate 
restriction sites. The resulting plasmid is transfected into COS cells in the manner 
described above, and the expression of the hVR-1, hVR-2, and rVR-2 polypeptide is 
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detected by radiolabelling and immunoprecipitation using an hVR-1, hVR-2, and rVR-2 
specific monoclonal antibody. 

EXAMPLE 4: ELECTROPHYSIOLOGICAL STUDIES OF VR2 

5 Human VR2 was functionally characterized in both HEK293 cells and Xenopus 

oocytes using electrophysiological methods. VR2 (in the pcDNA3.1 vector purchased 
by Invitrogen) was transiently expressed in HEK293 cells (ATCC) and recordings were 
performed 48 hours after transfection of cells using the whole-cell patch-clamp method 
(described in Bertil Hille, Ionin Channels of excitable membranes, 1992; Hammill et al 

10 (1981) Pluger Arch. 391:85-100). The results indicate that heat stimulation (>50 °C) 
induces a rapid inactivating inward current (1-2 nA). Heat-evoked currents of VR2 
displayed profound desensitization and could be reversibly blocked by the VR1 inhibitor 
capsazepin (at a 10 \iM concentration). In contrast to rat VR1, Capsaicin (at a 1-10 |iM 
concentration), resiniferatoxin (at a 0.1-3 concentration), and low pH (5.0-6.0) do 

1 5 not induce any currents from VR2. Binding studies of [ 3 H]-resiniferatoxin (NEN) to 
both human VR1 and VR2 in membranes isolated from HEK293 cell homogenates also 
indicate that resiniferatoxin (at a 0.1-10 nM concentration) has no specific binding to 
VR2 while it binds to human VR1 with high affinities. 

For the oocyte studies, human VR2 was subcloned into an oocyte expression 

20 vector containing 5 and 3 '-UTR of Xenopus p-globin (Chiara et al ( 1 999) 

Biochemistry 38(20)6689-6698). In vitro trasncription was carried out as described in 
Chiara et al {supra) and cRNA (10-100 ng) was then injected into the oocytes. VR2 
function was characterized in the oocytes 48 hours after cRNA injection using a standard 
two-electrode voltage-clamp. Consistent with the data from the HEK293 studies, VR2 

25 can only be activated by heat stimulation (48-50 °C) but not by vanilloid receptor 

agonists, capsaicin, or resiniferatoxin. The vanilloid receptor antagonist capsazepine (at 
a 1-10 fiM concentration) blocks the heat response of VR2 reversibly. 



10 
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EXAMPLE 5: GENERATION OF ANTI-hVR-2 ANTIBODIES AND hVR-2 
PROTEIN LOCALIZATION BY IMMUNOSTAINING 

Polyclonal antisera were raised in rabbits against the following three peptides 
derived from the human VR2 amino acid sequence, using the techniques described in Ed 
Harlow and David Lane (1988) "Antibodies; A Laboratory Manual" Cold Spring harbor 
Laboratory Press. 

Antibody PEPTIDE 1: AFHCKSPHRHRMVVLE (SEQ ID NO:13) 
Antibody PEPTIDE 2: RPEAPTGPNATESVQPMEGQEDEGN (SEQ ID N0.14) 
Antibody PEPTIDE 3: SVLEMENGYWWCRKKQRAG (SEQ ID N0.15) 
Antisera were subsequently affinity purified using the peptide immunogen. 

The polyclonal antisera were tested for immunostaining of both monkey and rat 
dorsal root ganglion sensory neurons. Peptides 1 and 3 gave specific staining of 
subpopulations of sensory neurons that was competed with the corresponding peptide. 
This pattern of expression was very similar to the one observed using a VR-2 riboprobe. 

EXAMPLE 6: CHROMOSOMAL LOCALIZATION OF hVR-1 AND 
hVR-2 

To chromosomally map the hVR-1 ge „ e , primers were designed based on the 
sequence of hVR-1 (clone Fchrb87a6) (amplifying a 1 77 bp product from a human 
20 control cell line DNA and multiple faint larger products from a control Hamster cell line 
DNA by PCR). These primers were used to amplify 93 DNAs in duplicate from the 
Genebridge 4 Radiation Hybrid Panel (Research Genetics, Inc., Huntsville, AL). 

The hVR-1 primers used in the PCR mapping studies were: forward - 
TAGGAGACCCCGTTGCCACG (SEQ ID NO:16) and reverse - 
25 GATTCACTTGGGGACAGTGACG (SEQ ID NO:17) and the PCR reactions were 
performed as follows: 5 M I Template DNA (lOng/ul), 1.5ul 10X Perkin Elmer PCR 
Buffer, 1.2ul Pharmacia dNTP mix 2.5 mM, 1.15ul Forward primer 6.6uM, 1.15ul 
Reverse primer 6.6uM, 5ul Gibco/BRL Platinum Taq .05U/ul (Hot Start), using an 
amplification profile of: 95°C for 10 minutes followed by 35 Cycles of 94'C for 40 
30 seconds, 55°C for 40 seconds, 72°C for 40 seconds, and 72'C for 5 minutes. The PCR 
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products were run on 2% agarose gels, post-stainedwith SYBR Gold (1:1 0,000 dilution 
in IX TBE), and scanned on a Molecular Dynamics 595 Fluorimager. 

The following is the vector data for the 93 Genebridge4 hybrid DNAs. These are 
in order 1-93. A "1" is a positive result, a is a negative result, a is an ambiguous 
result. 

hVRl 1 --17-1-1-1 1 — 1 — 1-1 1--1-1-1--1 1-1 1-1 1 1 

1 — I 1 1 1-1--1 1 1-1-1 — 1 l-i ---1 

RH linkage analysis was performed using the Map Manager QTb28 software package. 

hVRl was found to map to the p arm of human chromosome 17, 18.9 cR 3000 
telomeric to the Whitehead Institute framework marker WI-6584, and 7.7 cR 3000 
centromeric of the Whitehead framework marker WI-5436. LOD scores for linkage 
were 14.5 for WI-6584 and 19.3 for WI-5436. This region corresponds to the 
cytogenetic location 17pl2-13. This region is syntenic to mouse chromosome 11. 

To chromosomal ly map the hVR-2 gene, primers were designed from 5' UTR 
sequence of human VR2 (clone Flh21el 1) (amplifying a 166 bp product from a human 
control cell line DNA and 2 much larger faint bands from a control Hamster cell line 
DNA by PCR). These primers were used to amplify 93 DNAs in duplicate from the 
Genebridge 4 Radiation Hybrid Panel (Research Genetics, Inc., Huntsville, AL). 

The hVR-2 primers used in the PCR mapping studies were: forward - 
TTAAGCTCCCGTTCTCACCG (SEQ ID NO: 18) and reverse - 
GCTGCGGGAGGAAGTGAAGC (SEQ ID NO: 19) and the PCR reactions were 
performed as follows: 5nl Template DNA (lOng/jil), 1.5^1 10X Perkin Elmer PCR 
Buffer, l.2\il Pharmacia dNTP mix 2.5mM, 1.15^1 Forward primer 6.6nM, 1.15^x1 
Reverse primer 6.6nM, 5\il Gibco/BRL Platinum Taq .05U/^1 (Hot Start), using an 
amplification profile of 95°C for 10 minutes, followed by 35 Cycles of 94°C for 40 
seconds, 55°C for 40 seconds, 72°C for 40 seconds, and 72°C for 5 minutes. The PCR 
products were run on 2% agarose gels, post-stainedwith SYBR Gold (1:1 0,000 dilution 
in IX TBE), and scanned on a Molecular Dynamics 595 Fluorimager. 
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are 



The following is the vector data for the 93 Genebridge4 hybrid DNAs. These 
in order 1-93. A "1" is a positive result, a is a negative result, a "?" is an ambiguous 



result. 



hVR2 1--H-711 1-1-11— -1-1111-1 11-111-- 1 

1--11 1 J 1 1-1 LI-] J 1 J J...J 1-1.1.9...? 

RH linkage analysis was performed using the Map Manager QTb28 software package. 



0000 
-3000 



hVR2 was found to map to the p arm of human chromosome 1 7 , 29.3cR cR 
telomeric to the Whitehead Institute framework marker D 1 7S72 1 , and 23.3 cR 
cemromeric of the Whitehead framework marker AFMA0432B5. LOD scores for 
linkage were 11.9 for D17S721 and 13.6 for AFMA043ZB5. This region corresponds to 
the cytogenetic location 17pl 1-12. This region is syntenic to mouse chromosome 11. 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 

claims. 
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1 . An isolated nucleic acid molecule selected from the group consisting of: 

(a) a nucleic acid molecule comprising the nucleotide sequence set 
5 forth in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12 or a complement thereof; and 

(b) a nucleic acid molecule consisting of the nucleotide sequence set 
forth in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12 or a complement thereof. 

2. An isolated nucleic acid molecule which encodes a polypeptide selected 
1 0 from the group consisting of: 

(a) a polypeptide comprising the amino acid sequence set forth in 
SEQ ID NO:2, 5, 8, or 11; and 

(b) a polypeptide consisting of the amino acid sequence set forth in 
SEQIDNO:2,5,8,orll. 

15 

3. An isolated nucleic acid molecule which encodes a naturally occurring 
allelic variant of a polypeptide comprising the amino acid sequence set forth in SEQ ID 
NO:2, 5, 8, or 11. 



20 4. An isolated nucleic acid molecule selected from the group consisting of: 

a) a nucleic acid molecule comprising a nucleotide sequence which 
is at least 83% identical to the nucleotide sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 
12, or a complement thereof; 

b) a nucleic acid molecule comprising a fragment of at least 20 

25 nucleotides of a nucleic acid comprising the nucleotide sequence of SEQ ID NO: 1,3,4, 
6, 7, 9, 10, or 12, or a complement thereof; 

c) a nucleic acid molecule which encodes a polypeptide comprising 
an amino acid sequence at least about 87% identical to the amino acid sequence of SEQ 
IDNO:2, 5, 8,orll;and 
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d) a nucleic acid molecule which encodes a fragment of a 
polypeptide comprising the amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1, wherein 
the fragment comprises at least 15 contiguous amino acid residues of the amino acid 
sequence of SEQ ID NO:2, 5, 8, or 1 1. 



5. An isolated nucleic acid molecule comprising the nucleic acid molecule 
of any one of claims 1 , 2, 3, or 4, and a nucleotide sequence encoding a heterologous 
polypeptide. 



6. A vector comprising the nucleic acid molecule of any one of claims 1, 2, 

3, or 4. 

7. The vector of claim 6, which is an expression vector. 

8. A host cell transfected with the expression vector of claim 7. 



9. A method of expressing a polypeptide comprising culturing the host cell 
of claim 8 in an appropriate culture medium to, thereby, express the polypeptide. 
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10. An isolated polypeptide selected from the group consisting of: 

a) a fragment of a polypeptide comprising the amino acid sequence 
of SEQ ID NO:2, 5, 8, or 1 1, wherein the fragment comprises at least 15 contiguous 
amino acids of SEQ ID NO:2, 5, 8, or 1 1 ; 
5 b) a naturally occurring allelic variant of a polypeptide comprising 

the amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1, wherein the polypeptide is 
encoded by a nucleic acid molecule which hybridizes to a nucleic acid molecule 
consisting of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12 under stringent conditions; 

c) a polypeptide which is encoded by a nucleic acid molecule 
10 comprising a nucleotide sequence which is at least 83% identical to a nucleic acid 

comprising the nucleotide sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12; and 

d) a polypeptide comprising an amino acid sequence which is at least 
87% identical to the amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1. 

15 11. The isolated polypeptide of claim 1 0 comprising the amino acid sequence 

ofSEQIDNO:2,5,8,or II. 

12. The polypeptide of claim 10, further comprising heterologous amino acid 
sequences. 

20 

13. An antibody which selectively binds to a polypeptide of claim 1 0. 

14. A method for detecting the presence of a polypeptide of claim 1 0 in a 
sample comprising: 

25 a) contacting the sample with a compound which selectively binds to 

the polypeptide; and 

b) determining whether the compound binds to the polypeptide in 
the sample to thereby detect the presence of a polypeptide of claim 10 in the sample. 



30 



15. The method of claim 14, wherein the compound which binds to the 
polypeptide is an antibody. 
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16. A kit comprising a compound which selectively binds to a polypeptide of 
claim 10 and instructions for use. 

17. A method for detecting the presence of a nucleic acid molecule of any 
one of claims 1, 2, 3, or 4 in a sample comprising: 

a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to a 
nucleic acid molecule in the sample to thereby detect the presence of a nucleic acid 
molecule of any one of claims 1, 2, 3, or 4 in the sample. 

1 8. The method of claim 1 7, wherein the sample comprises mRN A 
molecules and is contacted with a nucleic acid probe. 

19. A kit comprising a compound which selectively hybridizes to a nucleic 
acid molecule of any one of claims 1, 2, 3, or 4 and instructions for use. 

20. A method for identifying a compound which binds to a polypeptide of 
claim 10 comprising: 

a) contacting the polypeptide, or a cell expressing the polypeptide 
with a test compound; and 

b) determining whether the polypeptide binds to the test compound. 

21 . The method of claim 20, wherein the binding of the test compound to the 
polypeptide is detected by a method selected from the group consisting of: 

a) detection of binding by direct detection of test 
compound/polypeptide binding; 

b) detection of binding using a competition binding assay; and 

c) detection of binding using an assay for hVR- 1 , h VR-2, or rVR-2 

activity. 
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22. A method for modulating the activity of a polypeptide of claim 10 
comprising contacting the polypeptide or a cell expressing the polypeptide with a 
compound which binds to the polypeptide in a sufficient concentration to modulate the 
activity of the polypeptide. 

5 

23. A method for identifying a compound which modulates the activity of a 
polypeptide of claim 10 comprising: 

a) contacting a polypeptide of claim 1 0 with a test compound; 'and 

b) determining the effect of the test compound on the activity of the 
1 0 polypeptide to thereby identify a compound which modulates the activity of the 

polypeptide. 

24. A method for treating a subject having a disorder characterized by 
aberrant hVR-1 or hVR-2 protein activity or nucleic acid expression comprising 

1 5 administering to the subject a hVR-1 or hVR-2 modulator such that treatment of the 
subject occurs. 

25. The method of claim 24, wherein the hVR-1 or hVR-2 modulator is a 
small molecule. 

20 

26. The method of claim 24, wherein the disorder is a pain disorder. 
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humanVRl gene with translation of open reading frame 

input file Fchrb67a6.«eo; Output Pile Fchrb87a6. tra 
Sequence length 3909 

<***^<™*>™^^ 
Oa^OTSAOCGCAAOKaCTOCGOCKa 

TTTATGCTTCCGGCTCGTATGTTGTGTGGAArrCn^GCGGAT^ 

ACGCC^GCTCTAATACGACTCACTATACK^AAAGCTGGTACGCCTGCA^A 
AOOCGTCCGAAAACACACCTCTCIX^^ 

<n«CICTG<^GAATTC^^ 
CCMAMUGCTGTCCACACTAGTCXXKCC^^ 

OOTCTGCCaATATTAAATOQAAAATTCTI^AAACAGTTCCCAAGTTTTCCCTTGTO 

AaTCACTTAGTam^TCOCCCeTCCTOOTATra 

MKKWSSTDLGTAAnt>T ^ 
GCAAGG ATG AAG AAA TOG AGC ASC ACA GAC TTO GOG ACA OCT 000 GAC CCA CTC CAA AAO S4 

DTCp nPLDODPH8Rppt>»»„ 

_ ^ ACC w ca^eoccTOaw^^cciMciccMGcaccrcaQww dl 

A< "° *^ CQC AOC caa TTT GOG AAQ GOT OAC TCG GAG GAG OCX 174 

JLJL VDCPHJt8< » 8 l'»*CPTITV 7B 
TTC CCG OTG GAT TGC OCC CAC GAG GAA GOT GAG TTG GAC TCC TGC COG ACC ATC ACA GTC 234 

* PVI *XQ*FGDOPT<j A « 1<I , s M 
AOC CCT OTT ATC ACC ATC CAG AGG CCA GGA GAC GGC OCC ACC GGT GCC AGG CTG CT0 TCC 294 

«?„-? 8 VA ASTB,C Tt *I"VDRRfil 118 
CAG OAC TCT OTC OCC OCC ASC AOC GAG AAO AOC CTC AGO CIC TAT OAT OQC AGO AST ATC 354 

r BAV sAQKHCQDLS8LI,LPl,0 13fl 
TTT GAA OCC OTT OCT CAG AAT AAC TGC CAS GAT CTO GAG ASC CTG CT8 CTC TTC CTO CAG «4 
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KSKKHLTDNEFKDPBTOKT C 158 
AAG AGC AAG AAG CAC CTC ACA GAC AAC GAG TTC AAA GAC CCT GAG ACA GGG AAG ACC TGT 474 



LLKAMLNLHDGQKTTIPLLL 176 

CTG CTG AAA GCC ATG CTC AAC CTG CAC GAC GGA CAG AAC ACC ACC ATC CCC CTG CTC CTG 534 

EIARQTDSLKELVNASYTD 6 198 

GAG ATC GCG CGG CAA ACG GAC AGC CTG AAG GAG CTT GTC AAC GCC AGC TAC ACG GAC AGC 594 

YYKGQTAIjR IAIBRRNMALV 218 

TAC TAC AAG GGC CAG ACA GCA CTG CAC ATC GCC ATC GAG AGA CGC AAC ATG GCC CTG GTG 654 

TLLVEKGADVQAAAHGDFFK 238 

ACC CTC CTG GTG GAG AAC GGA GCA GAC GTC CAG GCT GCG GCC CAT GGG GAC TTC TTT AAG 714 

KTKGRPGFYFGBLPLSLAAC 258 

AAA ACC AAA GGG CGG CCT GGA TTC TAC TTC GGT GAA CTG CCC CTG TCC CTG GCC GCG TGC 774 

TNQLGIVKFLLQNfiMQ TADI 278 

ACC AAC CAG CTG GGC ATC GTG AAG TTC CTG CTG CAG AAC TCC TGG CAG ACG GCC GAC ATC 834 

SARDSVGNTVLHALVBVADN 298 

AGC GCC AGC GAC TOG GTG GGC AAC ACG GTG CTG CAC GCC . CTG GTG GAG GTG GCC GAC AAC 894 

TADNTKFVTSMYWEILM L GA 318 

ACG GCC GAC AAC ACG AAG TTT GTG ACG AGC ATG TAC AAT GAG ATT CTG ATG CTG GGG GCC .954 

KLHPTLKLBBLTM KKGMTPL 338 

AAA CTG CAC COG ACG CTG AAG CTG GAG GAG CTC ACC AAC AAG AAG GGA ATG ACG COG CTG 1014 

ALAAGTGKXGVLAYILQRBI 358 

GCT CTG GCA GCT GGG ACC GGG AAG ATC GGG GTC TTG GCC TAT ATT CTC CAG CGG GAG ATC 1074 

Q B P E C R H L S RKFT B HAYG PV 378 

CAG GAG CCC GAG TGC AGG CAC CTG TCC AGO AAG TTC ACC GAG TGG GCC TAC GGG CCC GTG 1134 

H s 5 L YDlffi C I D T C B KNSVLB 398 

CAC TCC TOG CTG TAC GAC CTG TCC TGC ATC "GAC ACC TGC GAG AAG AAC TOG GTG CTG GAG 1194 

VIAYSB6BTPHRHDMLLVBP 418 

GTG ATC GCC TAC AGC AGC AGC GAG ACC CCT AAT OGC CAC GAC ATG CTC TTG GTG GAG COG 1254 

LNRLLQDKWDRFVKRIFYFN 438 

CTO AAC CGA CTC CTG CAG GAC AAG TGG GAC AGA TTC OTC AAG OGC ATC TTC TAC TTC AAC 1314 

FLVYCLYMI XFTMAAYYRPV 458 

TTC CTO GTC- TAC TOC CTG TAC ATG ATC ATC TTC ACC ATG OCT GCC TAC TAC AGG CCC GTG 1374 

DGLPPFKM8KIGOYFRVTOB 478 

GAT GGC TTG CCT CCC TTT AAG ATG GAA AAA ATT GGA GAC TAT TTC CGA GTT ACT GGA GAG 1434 

XL6VLGGVYFFFRG ZQYFLQ 498 

ATC CTG TCT GTG TTA GGA GGA GTC TAC TTC TTT TTC CGA GGG ATT CAG TAT TTC CTG CAG 1494 



FIGURE 1 (cont'd) 
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RRPSMKTL FVDSYSBMLFFL 518 
AGG CGG CCO TCG ATG AAG ACC CTC TTT GTG GAC AGC TAC AGT GAG ATG CTT TTC TTT CTG 1554 

QSLFMLATVVLYFSHLKEYV 53B 
CAG TCA CTG TTC ATG CTG GCC ACC GTG GTG CTG TAC TTC AGC CAC CTC AAG GAG TAT GTG 1614 

AS M V FSLA LGWTNMLYYTRG £58 
GCT TCC ATG GTA TTC TCC CTG GCC TTG GGC TGG ACC AAC ATG CTC TAC TAC ACC CGC GGT 1674 

FQQHGIYAVM IBKMILRDLC 578 
TTC CAG CAG ATG GGC ATC TAT GCC GTC ATG ATA GAG AAG ATG ATC CTG AGA GAC CTG TGC 1734 

RFMFVYIV FLFGFSTAVVTL 598 
CGT TTC ATG TTT GTC TAC ATC GTC TTC TTG TTC GGG TTT TCC ACA GCG GTG GTG ACG CTG 1794 

I E DG KNDS LPSESTSHRWRG 618 
ATT GAA GAC GGG AAG AAT GAC TCC CTG CCG TCT GAG TCC ACG TCG CAC AGG TGG CGG GGG 1854 

P A C R P PDSSYNSLYSTCLEL 636 
CCT GCC TGC AGG CCC CCC GAT AGC TCC TAC AAC AGC CTG TAC TCC ACC TGC CTG GAG CTG 1914 

FKFTIGMGDLB FTENYDFK A 656 
TTC AAG TTC ACC ATC GGC ATG GGC GAC CTG GAG TTC ACT GAG AAC TAT GAC TTC AAG GCT 1974 

VFZ ILLLAYVILTYILLLKM 678 
GTC TTC ATC ATC CTG CTG CTG GCC TAT GTA ATT CTC ACC TAC ATC CTC CTG CTC AAC ATG 2034 

LIALMGETVNKIAQESKNIW 698 
CTC ATC GCC CTC ATG GGT GAG ACT GTC AAC AAG ATC GCA CAG GAG AGC AAG AAC ATC TGG 2094 

XLQR AXTI LDTEKSFLKCMR 718 
AAG CTG CAG AGA GCC ATC ACC ATC CTG GAC ACG GAG AAG AGC TTC CTT AAG TGC ATG AGG 2154 

KAFRSGKLLQVGYTPDGKDD 738 
AAG GCC TTC CGC TCA GGC AAG CTG CTG CAG GTG GGG TAC ACA CCT GAT GGC AAG GAC GAC 2214 

YR WCFRVDEVNHTT1THTHVG 758 
TAC CGG TGG TGC TTC AGG GTG GAC GAG GTG AAC TGG ACC ACC TGG AAC ACC AAC GTG GGC 2274 

lIHBDPOUC EGVKRTLSFfiL 778 
ATC ATC AAC GAA GAC COG GGC AAC TGT GAG GGC GTC AAG CGC ACC CTG AGC TTC TCC CTG 2334 

RS8RV6GRHWKKFALVPLLR 796 
CGG TCA AGC AGA GTT TCA GGC AGA CAC TGG AAG AAC ITT GCC CTG GTC CCC CTT TTA AGA 2394 

BA8ARDRQSAQPEBVYLRQF 816 
GAG GCA AGT OCT CGA GAT AGG CAG TCT GCT CAG CCC GAG GAA GTT TAT CTG CGA CAG TTT 2454 

SG8 LKPBDAEVFKCPAASGE 838 
TCA GGG TCT CTG AAG CCA GAG GAC GCT GAG GTC TTC AAG AGT CCT GCC GCT TCC GGG GAG 2514 

K ♦ 840 

AAG TGA 2520 
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Full-length human VR2 

Input file Flh21ell.»eq; Output File Flh21ell.tr* 
St-guence length 2809 

O0CTAOOCTOIOcra * c W«JaMAGTO 

MAaACCaQAACCTOaiUtWUAQCrTA^ 

OOCAGCCCCTCCOGOCTTCACTTCC^^ 

_ • KTSPSSSP 

ttraCAGAGGTCCTGGCTOaACCGAGCAGC^^ ATO ACC TCA CCC TCC AGC TCT CCA 



e 

24 



VFRLBTLDGGQBDGSBADRG 2 6 
art TTC AOG TTO GAG ACA TTA GAT ©OA GGC CAA GAA GAT OGC TCT GAG GOG GAC AGA GGA 84 

KLDPGSGLPPMESOPQGfiDR 46 

AAG CTG GAT TTT GGG AGC GGG CTG CCT CCC ATG GAG TCA CAG TTC CAG GGC GAG GAC COG 144 

KFA PQIRVNLKVRKGTGAS 0 66 

AAA TTC GCC CCT CAG ATA AGA GTC AAC CTC AAC TAC CGA AAG GGA ACA GOT GCC ACT CAG 204 

PDFHRFDRDRLPKAVSR GVP fie 

COG GAT OCA AAC CGA TTT GAC OGA OAT CGG CTC TTC AAT GOG GTC TOC COG GOT GTC CCC 264 

BDX.AGLPEYX.SK TSKTLTDS 108 

GAG GAT CTG OCT GGA CTT CCA GAG TAC CTG AGC AAG ACC AGC AAG TAC CTC ACC GAC TOG 324 

ByTBGCTGKTCX.HCAVI.KXiK 128 

GAA TAC ACA GAG GGC TCC ACA GGT.AAfl AOG TOC CTG ATG AAG OCT GTG CTG AAC CTT AAG 384 

D G-VKACXL FItLQZ DRDfiGttp 148 

GAC GGA GTC AAT GCC TGC ATT CTG OCA CTG CTG CAG ATC GAC AOG GAC TCT GGC AAT CCT 444 

0 P 1* V K A 0 C T V DY T R G*H 8 A L H 168 

CAG CCC CTG GTA AAT GCC CAG TOC ACA GAT GAC TAT TAC OGA GGC CAC AGC OCT CTG CAC 504 

IAXBKRSLQCVKLLVB HGAH 168 

ATC GCC ATT GAG AAG AGG ACT CTG CAG TOT GTG AAG CTC CTG GTG GAG AAT GGG GCC AAT 564 

▼ HARAC_GR P FQKGQGTCFYF 208 

OTO CAT OCC COG GCC TOC OGC OGC TTC TTC CAG AAG OGC CAA GGG ACT TGC TTT TAT TTC 624 

OKl#FLfiX.AACT'XO**DVVSTL 228 

GOT GAG CTA CCC CTC TCT TTO GCC OCT TOC ACC AAG CAG TOO OAT OTO OXA AGC TAC CTC €84 ' 

fcBKPBOFAGLQA.TDBQGNT V 248 

CTG GAG AAC CCA CAC CAG OCC GCC. AGC CTG CAG OCC ACT GAC TOC GAG OGC AAC ACA GTC 744 

LHA LVKX£D«6 ABKIALVT£ 268 

CTG CAT GCC CTA OTO ATG ATC TCO GAC AAC TCA OCT GAG AAC ATT OCA CTG GTG ACC AGC 804 

M TDGLLQAOARtCPTVQLBD 288 

ATO W GAT O0O CTC CTC CAA OCT G9G OCC CGC CTC TOC CCT ACC GTG CAG CTT OAG GAC 844 

XRRLQDLTFLCLAACBGKIB 308 
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ATCCWrAACCTOCTOOATCTCACOCCTCTOAAOCTOCKXraCCAIW 924 

ATrTTCAGCCACATCCTOCAGCOO^TTTTCAOaACTOAOCCACOT »a« 

TEWCTOPVR .VSI.TO LA8VDS 348 

ACC OAO TOO TOC TAT OOO CCT OTC COO CTO TOO CTO TAT SAC CTO OCT TCT OTO OAC AOC 1044 

CKBM8V |.BZIArBCK8PHRH J6S 

TOT OAO GAG AAC TCA OTO CTO OAC ATC ATT OCC TTT CAT TOC AAC AOC COO CAC OQA CAC 1104 

RHVV LEPI.H!CI.I.OAK««>l*'I 386 

OOA ATC OTC CTT TTO OAO CCC CTO AAC AAA CTO CTO CAO OCO AAA TOG OAT CTO CTC ATC 1164 

CCC AAG TTC TTC TTA AAC TTC CTO TOT AAT CTG ATC TAC ATO TTC ATC TTC ACC OCT CTT 1224 

AYHOPTLKKOAAPHLKAFVO 428 

CCC tac CAT CAG CCT ACC CTG AAG AAG CAG -GCC OCC CCT CAC CTO AAA GOG OAG GTT OGA 1284 

HSMLLTOHII.Il.l'0<»I*»' I ' V 448 

AAC TCC ATG CTG CTO ACO OOC CAC ATC CTT ATC CTG CTA OOO OOO ATC TAC CTC CTC OTO 1344 

OOLKTP«P.RHVFI«ISPIBB 468 

GOC CAG CTO TOO TAC TTC TOG COG CGC CAC GTG TTC ATC TOO ATC TOO TTC. ATA OAC AOC 1404 

YFBXLFI.rOAI.I' T,rV eOVLC 488 

TAC TTT GAA ATC CTC TTC CTO TTC CAG OCC CTC CTC ACA CTO OTO TCC CAO OTO CTO TOT 1464 

TTCCTCG^ArCC^TOGTACCrGCCCCTC «*« 

■ i.I.T*TROr0BTOir6VMI0 S28 

AACCTOCTttL^ALe^OOCTTCCAflCACI^OOCAW X584 

AAG OTC ATC CTO COO OAC CTO CTO CGC TTC CTT CTO ATC TAC TTA OTC TTC CTT TTC OOC l^ 

ttcoctc^c^c^gtoaocctcaL^ 

CCCAATGC^ACAC^TCAO^CAOCCCA^ 

caotIcaLootaxc™oL<^t^ "« 

oLcTOCCCTTCcLo^CAOCTCCACTTCCOCO^ 
JcOTCC^ACCtLaXCCTOCTCCTC^ 

„„„„ m _ D8l rBi«Ki.Qr* i8V tea 

™&™m&to**wr™™™^™ m ™ mmMmim 2004 

TVO T« »»0«»» B «" CF,lV " B 

y\t?TWL 2 (ooottoiad) 
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Partial human VR2 alternate form 

Input file frhobl2c4.eeq; Output File frhabl2c4.tr* 
Sequence length 1469 

GRFF QKQQGTCPYF08X/FL 19 
OC GOC OOC TTC TIC CM AM OOC CAA GOG ACT TOC TTT TAT TTC GOT GAG CTA OCC CTC 57 

SLAACTKOtrDVVSYLLBNFti 39 
TCT TTG OCC CCT TOC ACC AAG CA0 TOO OAT CTO OTA AOC TAC CTC CTO GAG AAC CCA CAC 117 

0FASLQATO80ONTVLHALV 59 
CAO CCC OCC AOC CTG CAS OCC ACT QAC TCC CAG OGC AAC ACA OTC CTO CAT GCC CTA OTG 177 

H I C D K fi A B N IALVTSNTDCI, 75 
ATG ATC TOG GAC AAC TCA CCT GAG AAC ATT CCA CTC GTG ACC AOC ATG TAT OAT GOG CTC 237 

LQAOARLCFTVOLBOZRlf LO 99 

CTC CAA OCT GGG GCC OOC CTC TOC CCT ACC GTG CAG CTT GAG GAC ATC CGC AAC CTC CAG 297 

DLTPLKLAAX.BGKZBIFRHZ 119 
GAT CTC ACC CCT CTO AAG CTG GCC GCC AAO GAG GOC AAG ATC GAG ATT TTC AGO CAC ATC 357 

LQRBFSGLSHLSRXFTBWCy 139 
CTC CAO OCO GAG TXT TCA OGA CTG AGC CAC CTT TCC COA AAG TTC ACC GAG TOO TOC TAT 417 

OPVRV8LY DL A SVD8CBEBS 1S9 
OOG CCT CTC OOG GTG TOD CTO TAT QAC CTO OCT TCT GTG GAC AOC TOT GAG GAG AAC TCA 477 

VLBIIAFHCKSPHRHRMVVL 179 
OTG CTO OAO ATC ATT GOC TTT CAT TOC AAO AGC COG CAC OGA CAC COA ATG OTC OTT TTG 537 

E F XiBKIiL Q . A KHD L L 2 P KFF L 199 
OAO CCC CTG AAC AAA CTO CTO CAO GOO AAA TOO GAT CTC CTC ATC CCC AAG TTC TTC TEA 597 

HFLCNLXYMFIFTAVAYHQP 219 
AAC TTC CTO TOT AAT CTO ATC TAC ATG TTC ATC TTC ACC OCT OTT OOC TAC CAT CAO OCT 657 

TL .KKQAAP BliKABVOBSMLL 239 
AOC CTO AAG AAG CAO GOC OOC OCT CAC CTO AAA GOO OAO OTT OGA AAC TOC ATG CTO CTO 71? 

TGBIX.ILI1OOIYLLVGOLWY 259 
AOC OOC CAC ATC CTT ATC CTO CTA OOG 000 ATC TAC CTC CTC OTG OOC 00 CTO TOO TAC 777 

~P~ ff R R B V F I B X • « W X D 8 # Y F 8 X « - 279~ 
TTO TOO COO OOC CAC 0X0 TTC ATC TOO ATC TOO TTC ATA GAti AOC* TAC TTT GAA ATC CTC 837 

FLFQALLTVVBQV LCFL A I E 299 

TIC CTO TIC CAO OOC CTO CTC ACA OTG OTG TOC CAO OTO CTO TOT TTC CTO OOC ATC GAG 897 

ffYLPLL VlALV lGNLHLLYY 319* 
TOO TAC CTO CCC CTG CTT OTO TCT GOO CTO OTO CTO OOC TOO CTO AAC CTO CTT TAC TAT 9S7 

T R O FQBTO X T6 T M X.QK K A X fi 339 
ACA GOT OOC TTC GAG CAC ACA OOC ATC TAC AOT OTC ATO ATC CM AAO AAA OOC ATC TCT X01T 

V LEHEBOTBWCRKCQRAOVH 359 

OTC CTO OAO ATO OAO AAT OOC TAT TOO TOO TOC AGO AAO AAO CM 000 OCA GOT OTO ATO 1077 



FTGUPE 3 * 



WO 00/29577 



PCT/US99/2670! 



9/35 



•I 



=1 



"is 

u 



-I 



H 

O 
> 
H 
»3 



6 



B -I 



-S 



-I 



O 



•B 



6 *B 



•I 



-B 
►8 



■s 



6 *S °g 



I 

B § 



6 '8 



o 

-B 
-S 

•I 

W ^ 
cog 

< 



•B 



5 



WO 00/29577 



PCT/US99/26701 



10/35 



Partial rat VR2 

Input file FlrxblOgll.aeq; Output Pile Plrxbl47gli.tr* 
Sequence length 1794 

STHASALSLAACTKOWDVV 19 

C TCO ACC CAC OCO TCC OCT CTT TCT CTG OCT OCC TOC ACC AAG CAG TOO OAT OTO OTG 57 

TTLLEWPHCPAfiLBATDSLG 39 

ACC TAC CTC CTO GAC AAC CCA CAC CAG CCG OCC AOC CTO OAO OCC ACC OAC TCC CTC OOC H? 

KTVLHALVHIADHSPBHSAL 59 

AAC ACA OTC CTO CAT OCT CTO OTA ATG ATT OCA CAT AAC TCO CCT GAG AAC ACT OCC CTO 177 

VIHMYDg'lLQHGARLCPTVQ 79 

CTO ATC CAC ATO TAC OAC GOG CTT CTA CAA ATG OCO OCO COC CTC TOC CCC ACT OTG CAC 237 

LBEXSHHOGLTPLKLAAKBC 99 

CTT GAG GAA ATC TCC AAC CAC CAA GGC CTC ACA CCC CTG AAA CTA OCC OCC AAG GAA GGC 297 

KIBIPR H IL0 RBPSGPYQPL 119 

AAA ATC GAC ATT TTC AGO CAC ATT CTG CAO COG GAA TTC TCA CGA CCC TAC CAC CCC CTT 357 

SRKPTEWCTOPVRVfiLYDLfi 139 

TCC CGA AAG TTT ACT GAG TGG TOT TAC GOT CCT OTO CGO CTA TCG CTO TAC GAC CTO TCC 417 

SVDSHBK HSVLKIIAPHCKfi 159 

TCT OTG GAC AGC TGG GAA AAG AAC TCO GTO CTO GAO ATC ATC OCT TTT CAT TOC AAC ACC 477 

pHRHRHVVLEPltHKLLOBKW 179 

CCQ AAC COO GAC CSC ATO OTC CTT TTA GAA CCA CTG AAC AAG CTT CTO CAG GAG AAA TOO 537 

OR LVSRFFFKFACT1.VTMPI 199 

GAT COG CTC CTC TCA AGA TTC TTC TTC AAC TTC GCC TOC TAC TTO OTC TAC ATG TTC ATC S97 

FTVVAYHQPSLDQFAXPSfiK 219 

TTC ACC OTC GTT GCC TAC CAC CAG CCT TCC CTG GAT CAO CCA OCC ATC CCC TCA TCA AAA 6S7 

KTTQBBUhVhOBt* I**OQl 239 

GCG ACT TTT 000 GAA TCC ATO CTO CTG CTG GGC CAC ATT CTO ATC CTO CTT 000 GOT ATT 717 

YLLLO01-WTPWRRai'*I* IC 2 " 

TAC CTC TTA CTO OOC CAG CTC TGG TAC TTT TOO COO COO COC CTO TTT ATC TOO ATC TCA 777 

TH~vB m -r'-*-n ************* * 79 

TTC ATO OAC AOC TAC TTT CAA ATC CTC TTT CTC CTT CAO OCT CTO CTC ACA OTO CTO TOC Ml 

Qv***um***r* ************ 

<^OTOCIOCOCTTCjaOOWJU^OAA7»lM CTA / CCC CTO CTA OTO TTA TCC CTA OTO M7 

LOWLKLfcYYTROrOHTOIYfi "f 

CTOOOCTOOCTOJu^CTOCrTTACTA^ ,S7 

VMIQKV ILRDLLRPLLVTLV 339 

CnCJOTATCCMAMOTCATCCTTCaAOACCrOCrCCOTW xo« 

Tls*Q*AV**V**** BK *** % * B9 

TTC CTT TTC OOC TTT OCT OTA OCC CTA OTA AOC TTO AOC AGA GAO OOC CGA AOT CCC AAA 1077 
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GAP of: humanvr2.pep check: 5746 from: 1 to: 764 
humanVR2 Flh21ell 

to: humanvrl.pep check: 6877 from: 1 to: 839 
humanVRl _Fbhl8547pat - fchrb87a6, 3909 bases, 4554 checksum. 

Symbol comparison table: 
/ddm_local /gcg/gcg_9 . 1 /gcgcore /da t a /runda ta/blosum62 . cmp 
CompCheck: 6430 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 

Quality: 1530 Length: 850 

Ratio: 2.003 Gaps: 10 

Percent Similarity: 55.378 Percent Identity: 46.348 

Match display thresholds for the alignment (s) : 
| - IDENTITY 
: - 2 
• - 1 

human vr 2. pep x humanvrl.pep 

, # " ] MTSPSSSPVF 10 

1 MKKWSSTDLGTAADPLQKDTCPDPLDGDPNSRPPPAKPQLPTAKSRTRLF 50 

11 RLETLDGGOEDGSEIADRGKLDFGSGLPPMESQFQGEDRKFAPQIRVNLNY 60 

: : l.ll s • 8 1 

51 GKGDSEEAFPVDCPHEEGELDSCPTI . TVSPVITIQRPGDGPTGARLLSQ 99 

• * * * 

61 RKGT6ASQPDPNRFDRDRLFNAVSRGVPEDIAGLPEYLSKTSKYLTDSBY 110 

..: :|| :l II.. :ll I =1 HMI.lt 

100 DSVAASTEKTLRLYDRRSIFEAVAQNNCQDLESLLLFLQKSKKHLTDNEF 149 

* • • • 

111 TEU3STGKTCLMKAVLNlJ<DGVNACILPLLQIDRDSGNPQPIiVNAQCTDDy 160 

: III I II: I l.ll I II I I Ihl I ... HII III 
150 KDPETGKTC^KAMLNLHIXXJNTTIPLLLEIARQTDSLKELVKASYTDSY 199 

161 YRGHSALHIMEKRSLQCVKliLVENGAHVHftRACGRFFQKGQG . TCF5TFG 209 

1:1 .1111111:1.: I II 1 1 1 1 1 • I I I • H.I •« MM 
200 YKGOTAI^IAIERRNMALVTLLVENGADVQAAAHGDrFKKTKGRPGFYFG 249 

. 

210 ELPLSLAACTKQWDWSYXiENPHQPASLQATDSQGNTVLHALVMISDNS 259 

1 1 II 1 1 1 1 1 1 | :l :l|:l I I * I II MHIIIII t.ll. 

250 ELPI^I^CTNQWIVKFLWNSWQTADISARDSVGNTVLHALVEVADNT 299 

260 AEKIALVTSMYIXnJiQAGARlX:PTVQIXDIRNLQDLTPIJaARKEGKIEI 309 

|:| Hill. :l M. .M«i I • 'III. Ill Ml : 

300 ADOTKFVTSMYNEIljHLGAIOiHPTLiajEELTHKKGMTPLAIAAGTGKIGV 349 



FIGURE 5 



WO 00/29577 



PCT/US99/26701 



13/35 

310 FRHILQREFS..GLSHLSRKFTEWCyGPVRVSLYDLASVDSCEENSVLEI 357 

:||||| HII I II II III) I M II- :l-ll-lllll: 

350 1AYILQREIQEPECRHLSRKFTEWAYGPVHSS1.YDLSCI DTCEKNSVLEV 



399 



358 I AF . HCKSPHRHRMVVLEPLNKLLQAKWDLLI PK . FFLN FLCNLI YMFI F 405 

||: ..|. I I I... II II: IN ill : : Is HI : " 11 jn 
400 IAYSSSETPNRHDMLLVEPLNRLLQDKWDRFVKRIFYFNFLVYCLYMIIF 449 

. 

406 TAVAYHQPTLKKQAAPHLKAEVGNSMLLTGHILILLGGIYLLVGQLHYFW 455 

| ||:. | | .:|. .1111 .111:1 : N 

450 TMAAY YRPV . . DGLPPFKMEKI GDYFRVTGEI LSVLGGVYFFFRGIQYF1* 497 

456 rrhvfiwisfidsyfeilflfqalltwsqvu:fiaiewylpllvsalvl 505 

.1 . 1:111 I. II I.I -.HI '• 5 ' I 

498 QRRPSMKTLFVDSYSEMLFEXQSLFMLATWLYFSHLKEYVASMVFSLAL 547 

506 GWLNLLYYTRGFQHTGIYSVMIQKVILRDLLRFLLIYLVFLFGFAVALVS 555 

|| |:||llllll I II. I 11:1.1 II I I II: : I : I I I I I I • M- 
548 GWTNMLYYTRGFQQMG I YAVMI EKMILRDLCRFMFVYI VFLFGFSTAWT 597 

. • • 

556 LSQEAWRPEAPTG PNATES VQPMEGQEDEGNGAQYRG I LEASLELFKFT I 605 

... | . . | | . I : 1 1 1 I 1 1 II 
598 LIEDGKNDSLPSESTSHRWRGPACRPPD SSYNSX-YSTCLELFKFTI 643 

606 GMGELAFQEQLHFRGMVLLi,UAYVLLTYil.LLNMLIALMSETVNSVATD 655 

| YlTll I I: - ::llllll:llll Mill MM *■ ' ^ 

644 GMGDLEFTENY0FKAVFIILLLAYVILTYILLLNMLIALMGETVNK1AQE 693 

656 SWSIWKLQKAISVIXMENGYWWC . PJO<QRAGVMLTVGTKPDGSPDERWCF 704 

I 11111:11. :l: I : III LI :l II HI Mill 
694 SKNI^RAITXI^TEKSFLKCMPJQ^ 743 

705 R\nSEVNWASWEQTl«PTI£EDPSGA.GVPRTLENPVLASPPKEDEDGASEE 753 

11:1111.1 • : Ml II IH • 1 
744 RVDEVNWTTWNTNVGIXNEDPGNCEGVKRTLSFSLRSS. . . .RVSGRHWK 789 

* 764 

754 NYVPVQLLQSH 

790 HFALVPLI^EASARDRQSAQPEEVYIJIQFSGSLKPEDAEVFKSPAASGEK 839 
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GAP of: humanvr2.seq check: 8853 from: 1 to: 2809 
humanVR2 21ella, 2809 bases, 8853 checksum. 

to: humanvrl.seq check: 4554 from: 1 to: 3909 

humanVRl Fbhl8547pat - Import - complete 

Symbol comparison table: . 
/ddm_local/gcg/gcg_9.1/gcgcore/data/rundata/nwsgapdna.cmp 

CompCheck: 8760 

Gap Weight: '50 Average Match: 10.000 

Length Weight: 3 Average Mismatch: 0.000 

Quality: i4359 Length: 3934 

Ratio: 5.112 Gaps: 15 

Percent Similarity: 55.316 Percent Identity: 55.316 

Match display thresholds for the alignment (s) : 
| - IDENTITY 
: « 5 
• - 1 

humajivr2 . seq x humanvrl.seq 



1 ' GGCTAGCCTGTCCTGACAGGGGAGAG 26 

I I I I I II lit 
801 TGTCCACAGTAGTCCCCCCTTATCCACGGGTGTCACTTTCCATGGGTTCA 850 

27 TTAAGCTCCCGTTCTCCACCGTGCCGGCTGGCCAGGTGGGCTGAGGGTGA 76 

i i i i 1 1 imih i i ii » i rtA 

851 GTTATTTGCGGTCAACCACGGTCTGCCAATATTAAATGGAAAATTCTTCA 900 

77 CCGAGAGACCAGAACCTGCTTGCTGGAGCTTAGTGCTCAGAGCTGGGGAG 126 

|| III I I I II I I M HI I 1 II 

901 AACAGTTCCCAAGTTTTCCCTTGTGCATTGTTCTGAGCAGTGTGATGAAG 950 

•• ' • • 

127 GGAGGTTCCGCCGCTCCTCTGCTGTCAGCGCCGGCAGCCCCTCXXX3GCTT 176 

| I I II I II I I III I I 

951 AGTCTCTGOCGTGCCATCTGGGATGCAAACCC3TCCCTGTGTCCCOCAOGT 1000 

177 CACTTCCTOCCGCAGCCCCTGCTACTGAGAAGCTCCGGGATCCCAGCAGC 226 

I II III M Ml Ml II I I I 

1001 CCAGGCCGTAGATGCTCCCCG CCGGTCAGTCACTTAGTCGTCAGATCGCC 1050 

• * • • * 

227 CGCCACGCCCTGGC .CTCAGCCTGCGGG 253 

II " III' I I 

1051 CGTCCTGGTATCACAGTGCTTCT6TTCAGGTTGCACACTGGGCCACAGAG 1100 
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254 GCTCCAGTCAGGCCAACACCGACGCGCAGCTGGGAGGAAG 293 

I Mill HI t I 11111 It I 

XI 01 GATCCAGCAAGGATGAAGAAATGGAGCAGCACAGACTTGGGGACAGCTGC 1150 
• • * • 

294 ACAGGACCCTTGACATCTCCATCTGCACAGAGGTCCTG 331 

I IIMI I II I II III I Ml III 
1151 GGACCCACTCCAAAAGGACACCTGCCCAGACCCCCTGGATGGAGACCCTA 1200 

332 GCTGGACCGAGCAGCCTCCTCCTCCTAGGATGACCTCACCCTCCAGC . . T 379 
II I I II I II II III II III 

1201 ACTCCAGGCCACCTCCAGCCAAGCCCCAGCTCCCCACGGCCAAGAGCCGC 1250 

380 CTCCAGTTTTCAGGTTGGAGACATTAGATGGAGGCCAAGAAGATGGCTCT 429 
1 1 I I I II If III IIMM I Mil 

1251 ACCCGGCTCTTTGGGAAGGGTGACTCGGAGGAGGCTTTCCCGGTGGATTG 1300 

4 30 G AGGCGG ACAG AGGAAAGC TG G AT TT TGGG AGCGGGCTGCCTCCC ATGG A 4 79 
I II I I I I i M I I I 

1301 CCCCCACGAGGAAGGTGAGTTGGACTCCTGCCCGACCATCACAGTCAGCC 1350 

480 GTCACAGTTCCAGGGCGAGGACCGGAAATTCGCCCCTCAGATAAGAGTCA 529 
I I Ml MM I III I Mil l I 
1351 CTGTTATCACCATCCAGAGGCCAGGAGACGGCCCCACCGGTGCCAGG. .C 1398 

530 ACCTCAACTACCGAAAGGGAACAGGTGCCAGTCAGCCGGATCCAAACCGA 579 
II I I 111 II III III 

1399 TGCTGTCCCAGGACTCTGTCGCCGCCAGCACCGAGAAGACCCTCAGGCTC 1446 



580 TTTGACC^ GA ^ f M || | I I | | | |f|| 

1449 TATGATCGCAGGAGTATCTTTGAAGCCGTTGCTCAGAATAACTGCCAGGA 1498 

630 TCTGGCTGGACTTCCAGAGTACCTGAGCAAGACCAGCAAGTACCTCACCG 679 
IIMI I If I I MM MM II Ml IMMM I 

1499 TCTGGAGAGCCTGCTGCTCTTCCTGCAGAAGAGCAAGAAGCACCTCACAG 1548 



680 ACTCGGAATAGACAGAGGGCTCCACAGGTAAGACGTGCCTGATGAAGGC7 729 
M II I II Ml Mill Mill II III MM M 

1549 ACAACGAGTTCAAAGACCCTGAGACAGGGAAGACCTGTCTGCTGAAAGCC 1598 



. . 730 GTGCTGAACCTTAAGGACGGAGTCAATGCC7GCATTCTGCCACTGCTGCA 779 
MM IMM I IIMM M II Ml « I II III I 
1599 ATGCTCAACCTGCACGACGGACAGAACACCACCATCCCCCTGCTCCTGGA 1648 
• • • • • 

760 GATCGACAGGGACTCTGGCAATCCTCAGCCCCTGGTAAATGCCCA6TGCA 629 
Mill II I I I M I II II 1 1 II Ml I II 
164 9 GAT06CGCGGCAAACGGACAGCCTGAAGGAGCTTGTCAACGCCAGCTACA 1698 

830 CAGATGACTATTACCGAGGCCACAGCGCTCTGCACATCGCCATTGAGAAG 879 
I M III Ml Mill I M IMIIIIIIIIIII MM 
1699 CGGACAGCTACTACAAGGGCCAGACAGCACTGCACATOGCCATCGAGAGA .1748 
m * • • • 

880 AGGAGTCTGCAGTGTGTGAAGCTCCTGGTGGAGAATGGGGCCAATGTGCA 929 
I | || MM MIMMMIIMI II II I II II 

1749 CGCAACATGGCCCTGGTGACCCTCCTGGTGGAGAACGGAGCAGACGTCCA 1798 

930 TGCCCGGGCCTGCGGCCGCTTCTTCCAGAAGGGCCAAG, . .GGACTTGCT 976 
|| (IN || MUM MM I III II 11 I I 
1799 GGCTGCGGCCCATGGGGACTTCTTTAA6AAAACCAAAGGGCGGCCTGGAT 1848 

FIGURE 6 (cont'd) 
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. . • • • 

977 TTTATTTCGGTGAGCTACCCCTCTCTTTGGCCGCTTGCACCAAGCAGTGG 1026 
I || llllllll II IMM II lllllll MIIMII III I 
184 9 TCTACTTCGGtGAACTGCCCCTGTCCCTGGCCGCGTGCACCAACCAGCTG 1898 

1027 GATGTGGTAAGCTACCTCCTGGAGAACCCACACCAGCCCGCCAGCCTGCA 1076 

I I II I I III III IMM I III I III II 
1899 GGCATCGTGAAGTTCCTGCTGCAG AACTCCTGGCAGACGGCCGACATCAG 1948 

• • ♦ * 

1077 GGCCACTGACTCCCAGGGCAACACAGTCCTGCATGCCCTAGTGATGATCT 1126 

(III Kill .IMIIIIM II Hill Mill Ml I I 
194 9 CGCCAGGGACTCGGTGGGCAACACGGTGCTGCACGCCCTGGTGGAGGTGG 1998 

* • • • 

1127 CGGACAACTCAGCTGAGAACATTGCACTGGTGACCAGCATGTATGATGGG 1176 

I HUM I II It (III I IMM M I M I II III I 

1999 CCGACAACACGGCCGACAACACGAAGTTTGTGACGAGCATGTACAATGAG 2048 

1177 CTCCTCCAAGCTGGGGCCCGCCTCTGCCCTACCGTGCAGCTTGAGGACAT 1226 

III MMM M III II II MM Mill I 

204 9 ATTCTGATGCTGGGGGCCAAACTGCACCCGACGCTGAAGCTGGAGGAGCT 2098 

1227 CCGCAACCTGCAGGATCTCACGCCTCTGAAGCTGGCCGCCAAGGAGGGCA 1276 

I MM I Ml I Mill Ml Mill 1 1 I Ml 
2099 CACCAACAAGAAGGGAATGACGCCGCTGGCTCTGGCAGCTGGGACCGGGA 2148 

1277 AGATCGAGATTTTCAGGCACATCCTGCAGCGGGAGTT TTCAGGA 1320 

II II It I I M IMM I 1 1 1 I 1 1 1 1 I I I 

214 9 AGATCGGGGTCTTGGCCTATATTCTCCAGCGGGAGATCCAGGAGCCCGAG 2198 

1321 CTGAGCCACCTTTCCCGAAAGTTCACCGAGTGGTGCTATGGGCCTGTCCG 1370 

II Mill III I IMIMMMIMM III Mill II I 
2199 TGCAGGCACCTGTCCAGGAAGTTCACCGAGTGGGCCTACGGGCCCGTGCA 2248 

1371 GGTGTCGCTGTATGACCTGGCTTCTGTGGACAGCTGTGAGGAGAACTCAG 1420 

MIIMII MMM I I I MM III Ml I I I M 1 1 I 
224 9 CTCCTCGCTGTACGACCTGTCCTGCATCGACACCTGCGAGAAGAACTCGG 2298 

• • • . • 

1421 TGCTGGAGATCATTGCCTTTCATTGCA • . .AGAGCCCGCACCGACACCGA 1467 

MIIMII I II MM Ml Ml Ml I M Ml 

2299 TGCTGGAGGTGATCGCCTACAGCAGCAGCGAGACCCCTAATCGCCACGAC 2348 

♦ • • 

1468 ATGGTCGTTTTGGAGCCCCTGAACAAACTGCTGCAGGCGAAATGGGA. • ♦ 1514 

Ml II I lllllll MMM Ml MIMM II Mill 
2349 ATGCTCTTGGTGGAGCCGCTGAACdGACTCCTGCAGGACAAGTGGGACAG 2398 

1515 TCTGCTCATCCCCAAGTrCTTCTTAAACTTCCTGTGTAATCTGATCTACA 1564 

| Ml I II MM Ml IMIIIIM I I MM 
2399 ATTOGTCAAGCGCATCTTCTACTTCAACTTCCTGGTCTACTGCCTGTACA 2448 

• • • 

1565- TGTTCATCTTCACOGCTGTTGCCTACCATCAGCCTACCCTGAAGAAGCAG 1614 

II MMMMIM I lllllll f I Ml II 

2449 TGATCATCTTCACCATGGCTGCCTACTA CAGGCCCGTGGATGGCTT 2494 

• • • • 

1615 GCCGCCCXTCACCTGAAAGCGGAGGTTGGAAACTCCATGCTGCTGACGGG 1664 

III Ml I I M I I (MM Ml II I M II 

24 95 GCCTCCCTTTA . . AGATGGAAAAAATTGGAGACTATTTCCGAGTTACTGG 2542 
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1665 CCACATCCTTATCCTGCTAGGGGGGATCTACCTCCTCGTGGGCCAGCTGT 1714 

I UNI II If If If Mill III t I || 
2543 AGAGATCCTGtCTGTGTTAGGAGGAGTCTACTTCTTTTTCCGAGGGATTC 2592 

• • • 

1715 GGTACTTCTGGCGGCGCCACGTGTTCATCTGGATCTCGTTCATAGACAGC 1764 

Ml Ml II I I I II II III I I J I IMIII 
2593 AGTATTTCCTGCAGAGGCGGCCGTCGATGAAGACCCTGTTTGTGGACAGC 2642 

• • * 

1765 TACTTTGAAATCCTCTTCCTGTTCCAGGCCCTGCTCACAGTGGTGTCCCA 1814 

IM III II ||.. 1.1 I I I III I III III |(f || 
2643 TACAGTGAGATGCTTTTCTTTCTGCAGTCACTGTTCATGCTGGCCACCGT 2692 
* 

1815 GGTGCTGTGTTTCCTGGCCATCGAGTGGTACCTGCCCCTGCTTGTGTCTG 1864 

MINIM III I II || Ml Ml I M I 

2693 GGTGCTGTACTTCAGCCACCTCAAGGAGTATGTGGCTTCCATGGTATTCT 2742 

• • • . . 
1665 CGCTGGTGCTGGGCTGGCTGAACCTGCTTTACTATACACGTGGCTTCCAG 1914 

I MM NIMH! Ill MM Mill II II II IIMM 
2743 CCCTGGCCTTGGGCTGGACCAACATGCTCTACTACACCCGCGGTTTCCAG 2792 

• • • 

1915 CACACAGGCATCTACAGTGTCATGATCCAGAAGGTCATCCTGCGGGACCT 1964 

M I MIIMII IIIIIMI Mill I MUM | | If M 
2793 CAGATGGGCATCTATGCCGTCATGATAGAGAAGATGATCCTGAGAGACCT 2842 

• • • # « 
1965 GCTGCGCTTCCTTCTGATCTACTTAGTCTTCCTTTTCGGCTTCGCTGTAG 2014 

I M I I I I I I I I I I I II I I I I I I I II I M I || 
2843 GTGCCGTTTCATGTTTGTCTACATCGTCTTCTTGTTCGGGTTTTCCACAG 2892 

2015 CCCTGGTGAGCCTGAGCCAGGAGGCTTGGCGCCCCGAAGCTCCTACAGGC 2064 

I IMIII lilt I II I I I II I I 

2893 CGGTGGTGACGCTGATTGAAGACGGGAAGAATGACTCCCTGCCGTCTGAG 2942 

• * • • • * 
2065 CCCAATGCCACAGAGTCAGTGCAGCCCATGGAGGGACAGGAGGACGAGGG 2114 

Ml I III 1 I II I I II 

2943 TCCA CGTCGCACAGGTGGCGGGGGCCTGCCTGCAGGCC 2980 

2115 CAACGGGGCCCAGTACAGGGGTATCCTGGAAGCCTCCTTGGAGCTCTTCA 2164 

I II I Nil II Ml I MMMI MM 

2981 CCCCGATAGCTCCTACAACAGCCTGTACTCCACCTGCCTGGAGCTGTTCA 3030 

• • • • * 
2165 AATTCACCATCX5GCATGGGCGAGCTGGCCTTCCAGGAG 2214 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 I 1 1 1 1 III III I Mill 
3031 AGTTCACCATCGGCATGGGCGACCTGGAGTTCACTGAGAACTATGACTTC 3080 

• • • ♦ • 
2215 CGCGGCATGGTGCTGCTGCTGCTGCTGGCCTACGTGCTGCTCACCTACAT 2264 

I I I I I t 1 I I I I I I I I I I M II I If M M | ( | M 
3081 AAGGCTGTCTTCATCATCCTGCTGCTGGCCTATGTAATTCTCACCTACAT 3130 

2265* CCTGCTGCTCAACATGCTCATCGCCCTCATGAGCGA 2314 

III fll IMIII II M M IMIII If MM I Mill MMMI 
3131 CXTTCCTGCTCAACATGCTCATCGCCCT^ 3180 

• • « * . 
2315 TCGCCACTGACAGCTGG/IGCATCTGGAAGCTGCAGAAAGCCATCTCT6TC 2364 

mi it mi ii ii mi ii ii i ii i ii i iiinii i it 

3181 TCGCACAGGAGAGCAAGAACATCTGGAAGCTGCAGAGAGCCATCACCATC 3230 
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2365 CTGGAGATGGAGAATGGCTATTGGTGGTGCAGGAAGAAG . . . CAGCGGGC 2411 

Mill I INI I I III Mill II INI M l 

3231 CTGGACACGGAGAAGAGCTTCCTTAAGTGCATGAGGAAGGCCTTCCGCTC 3280 

2412 AGGTGTGATGCTGACCGTTGGCACTAAGCCAGATGGCAGCCCGGATGAGC 2461 

||f I Mill M II I M I I I I I 1 1 II I I 

3281 AGGCAAGCTGCTGCAGGTGGGGTACACACCTGATGGCAAGGACGACTACC 3330 

24 62 GCTGGTGCTTCAGGGTGGAGGAGGTGAACTGGGCTTCATGGGAGCAGACG 2511 

I M II II I I If I I J I M I INMMMIM I I III I I 
3331 GGTGGTGCTTCAGGGTGGACGAGGTGAACTGGACCACCTGGAACACCAAC 3380 

2512 CTGCCTACGCTGTGTGAGGACCCG. . .TCAGGGGCAGGTGTCCCTCGAAC 2558 

M | | II MINI I I M IM II II 

3381 GTGGGCATCATCAACGAAGACCCGGGCAACTGTGAGGGCGTCAAGCGCAC 34 30 

2559 TCTCGAGAACCCTGTCCTG GCTTCCCCTCCCAAGGAGGATGAGGAT 2604 

|| || III II M II II 

3431 CCTGAGCTTCTCCCTGCGGTCAAGCAGAGTTTCAGGCAGACACTGGAAGA 3480 

2605 GGTGCCTCTGAGGAAAACTATGTGCCCGTCCAGCTCCTCCAGTCCAACTG 2654 

| | I Ml II I IN II . II 

3481 ACTTTGCCCTGGTCCCCCTTTTAAGAGAGGCAAGTGCTCGAGATAGGCAG 3530 

2655 ATGGCCCAGATGCAGCAGGAGGCCAGAGGACAGAGCAGAGGATCTTTCCA 2704 

II Ml || I I Mill Ml Ml I I 

3531 TCTGCTCAGCCCGAGGAAGT^TATCTGCGACAGTTTTCAGGGTCTCTGAA 3580 

♦ 

2705 ACCACATCTGCTGGCTCTGGGGTCCCAGTGAATTCTGGTGGCAAATATAT 2754 

Ml I I I Ml Nil II I HI I 

3581 GCCA GAGGACGCTGAGGTCTTCAAGAGTCCTGCCGCTTCCGGGGA 3625 

2755 ATTTTCACTAACTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 2804 

I I I III 111 I Ml III 

3626 GAAGTGAGGACGTCACGCAGACAGCACTGTCAACACTGGGCCTTAGGAGA 3675 

2805 AAAAA 2809 

3676 CCCCGTTGCCACGGGGGGCTGCTGAGGGAACACCAGTGCTCTGTCAGCAG 3725 
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CLUSTAL W (1.74) multiple sequence alignment 



humanVR2 
ratVR2 



^PSSSPVFlUJm-DGGOEDGSEADRGI^ 



human VR2 
ratVR2 



humanVR2 
ratVR2 



huznanVR2 
ratVR2 



I^VENGANVHARACGRFI^KCWrcFyPGEl^I^I^ 

STHASAl^IAACTKQWDVVTYLLENPHQPASLEA 



humanVR2 
ratVR2 



TDSQGNTVIiHAL VM I SDNS AEN I ALVTSMYIX3IiIjOAGARI/rPTVQLED I RNLQDLTPLKL 
TDSl/JNTVLiHALVMIADNSPENSALV^ 



human VR2 
ratVR2 



AAKEGKIEIIiaiIIK)I^PSO-LSHl^WCFTEWCTGPW 

AAKEGKIEI P^IMREFSGPYQPLSRICPreWCTGPVRVSLYDljSSVDSWBKKSVLBI IA 



huaanVR2 
ratVR2 



raocsPHwmMvviapuncuw^ 

FHO^PNRHRMVVI^UTO^EJOTIU-VSR^ 



faumanVR2 
ratVR2 



APHUCAEVGKSKLLTGHILII^ 



humanVR2 
ratVR2 



human VR2 
ratVR2 



faumanVR2 
ratVR2 



I-TVVSQVW3^EKYIJ^VSALVLGWI^ 
l*TVLSCWWWETEinfI«XV^ 
***;*•*♦ #♦ *•••**•* . 

LrYLVFI^rAVALVSI^EAWIO>EAP7X3PKATKSV^ 

LVYLVFLFGRAV^VSLSRSARSPXILPSDW •APYKSXIjEASLE 
WKFTIGMmAPQBOWFRGVTO^^ 



bumanVR2 
ratVR2 



faumanVR2 
ratVR2 



PTLCgDPfiGgmPRTXiBHPVL^ 
PTIiSBDPSGPGXTGNKXNmK-MK-" 



FXGOKB 7 



WO 00/29577 



PCT/US99/26701 



20/35 

GAP of: ratvr2.pep check: 91>0 front 1 to: SS4 
ratVR2 Plrxbl47gll 

to: humanvrt.pep check: 5746 tram 1 to: 764 
faunanVR2 Flh21ell 

Syfcol W ari.on table: /« e r/loc.l/gcg. 9 .a/gcgcore/dat a /rund.t./blosu»6 2 .c-p 

CocnpCheck: €430 



Gap Height: 
Length Height: 



12 Average Match: 2.912 

4 Average Mismatch: -2,003 



Quality: 2182 Hg£ 76 J 

Percent SillSS! Percent Identity: 79.1.7 

Match display threaholds for the alignment (a) s 
| • IDENTITY 
: « 2 
. - 1 



ratvr2.pep x humanvr2 .pep 



1 STKASAl*!^^^ 

i mi mil • I MiiiiMiii « i J! J' 

mimiii ii ill 1 1 1 1 1 ■ 1 HUM II 1*1 • 1 ' mi1 



95 AAKEGKIE 



301 



350 



X9S 



I * 1 1 1 1 1 1 1 1 ilLLH 

EENSVLEX 



400 XTHFIFXAVAXHQ1 



245 



295 



145 Jw«Bmm«w?«H^ 



S50 « 



fmmfn I M h 1 1 IHH MM Ml I 

1 1 1 f lit I •n.JLJ L^L«^ 



44 

250 
94 

300 , 

144 

349 

194 

399 

244 

449 

294 

499 

344 

549 

392 

599 
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393 tFKFTIGMGHAroiW^GVVI^^ 442 

iiiniiiiiiitiiii in-HiiiiiHi m:i^miiiii 

600 LFKFTIOWEIATQEQl^FTl<»mJAI^WIATYILU^IAI^ETV «49 

443 imvADHSwsiHia«KAisvi£MBwmwc^iaa«EO «« 

493 DERKCFRVEEATJHAAWEKTIJ'TLSEDPSGPGITCWCKNPT . . . -SKPGKN 538 

539 SASEEOHLPLQVLQSP SS4 
749 QASEENYVPVQLLOSN 764 
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GAP of: humanvrl.seq check: 4554 from: 1 to: 3909 
humanVRl Fbhl8547pat - Import - complete 

to: ratvrl.seq check: 7921 from: 1 to: 2847 

ratVRl.seq AF029310 in GenBank 

Svrobol comparison table: 
/ddriocal/gcg/gcg_9.1/gcgcore/data/rundata/nwsgapdna.cmp 

CompCheck: 8760 

Gap Weight: ' 50 Average Match: 10.000 

Length Weight: 3 Average Mismatch: 0.000 

Quality: 22717 Length: 3914 

Ratio: 7.979 Gaps: 10 

Percent Similarity: 82.125 Percent Identity: 82.125 

Match display thresholds for the alignment (s) : 
| = IDENTITY 
: - 5 
. «=» 1 

humanvrl .eeq x ratvrl.seq • 



1001 CCAGGCCGTAGATGCTCCCCGCCGGTCA6TCACTTAGTCGTCAGATCGCC 10 

CAGCTCCAAGGCACTTGCTCC 21 



1051 COT—CACA^^ 

22 atttggggtgtgcctgc*^ 68 

1101 GATCCAGCAAGGATGAAGAAATGGAGCAG^CAGACTTWGGACAGCTGC 



1100 



1150 
1200 

Mill Mil II II III Mill I M I 1 1 I I I M 1 

119 GTC 



1151 f^ft^^T^ 

1 201 ACTCCAGt^CCTCCAGC^GCCC^CCCaCGG^^ "50 
1251 ACCCGGCTCTTTGGGAAGGGTGACTCGGAG€AGGCITTCCCGGTGGATTG 1300 

J351 mraMM»XKC»am^^ »»° 



FIGURE 9 



WO 00/29577 



PCTAJS99/26701 



23/35 

1401 CTGTCCCAGGACTCTGTCGCCGCCAGCACCGAGAAGACCCTCAGGCTCTA 1450 

til MINIM III MM I MMM Ml IMIIIM 
369 TCATCCCAGGACTCCGTCTCCGCTGG . . - TGAGAAGCCCCCGAGGCTCTA 415 

1451 TGATCGCAGGAGTATCTTTGAAGCCGTTGCTCAGAATAACTGCCAGGATC 1500 
I I f II M II M I Mill II II II IMMM M If It I M M I I 
416 TGATCGCAGGAGCATCTTCGATGCTGTGGCTCAGAGTAACTGCCAGGAGC 465 

1501 TGGAGAGCCTGCTGCTCTTCCTGCAGAAGAGCAAGAAGCACCTCACAGAC 1550 

MMMIIIMIMI IIMMIMM 1 1 I 1 1 I I I I 1 I Ml M Ml 
4 66 TGGAGAGCCTGCTGCCCTTCCTGCAGAGGAGCAAGAAGCGCCTGACTGAC 515 

1551 AACGAGTTCAAAGACCCTGAGACAGGGAAGACCTGTCTGCTGAAAGCCAT 1600 

I IMMMMMMM IMIIIM MMIIIMIMM IMIIIM 
516 AGCGAGTTCAAAGACCCAGAGACAGGAAAGACCTGTCTGCTAAAAGCCAT 565 

• • • • • 
1601 GCTCAACCTGCACGACGGACAGAACACCACCATCCCCCTGCTCCTGGAGA 1650 

MMM MMM I II Mill IMMM I I M II M II I I 
566 GCTCAATCTGCACAATGGGCAGAATGACACCATCGCTCTGCTCCTGGACG 615 

• • • • • 
1651 TCGCGCGGCAAACGGACAGCCTGAAGGAGCTTGTCAACGCCAGCTACACG 1700 

I I I Ml t II I I I I I I I J I I t I II IMMM HI IN 

616 TTGCCCGGAAGACAGACAGCCTGAAGCAGTTTGTCAATGCCAGCTACACA 665 

• • • • • 
1701 GACAGCTACTACAAGGGCCAGACAGCACTGCACATCGCCATCGAGAGACG 1750 

II IMMM I I I I M M I I I I MM I I I MM II I Mill II I M 

666 GACAGCTACTACAAGGGCCAGACAGCACTGCACATTGCCATTGAACGGCG 715 

• • • • • 
1751 CAACATGGCCCTGGTGACCCTCCTGGTGGAGAACGGAGCAGACGTCCAGG 1800 

MMM I MIMMIMM IIMMMM IMIIIM IMMM 
716 GAACATGACGCTGGTGACCCTCTTGGTGGAGAATGGAGCAGATGTCCAGG 765 

1801 CTGCGGCCCATGGGGACTTCTTTAAGAAAACCAAAGGGCGGCCTGGATTC 1850 
IMMM I IIMMIMM IMMMMMMM IMMM Ml 
766 CTGCGGCTAACGGGGACTTCTTCAAGAAAACCAAAGGGAGGCCTGGCTTC 815 
. * • • • 

1851 TACTTCGGTGAACTGCCCCTGTCCCTGGCCGCGTGCACCAACCAGCTGGG 1900 
Mill Mill 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 Mill III III III Mill 
816 TACTTTGGTGAGCTGCCCCTGTCCCTGGCTGCGTGCACCAACCAGCTGGC 865 

• • • • • 
1901 CATCGTGAAGTTCCTGCTGCAGAACTCCTGGCAGACGGCCGACATCAGCG 1950 

III II MMM MM IMMMMMIMIMI I II f 1 1 1 1 1 1 1 It 

866 GATTGTGAAGTTCCTGCTGCAGAACTCCTGGCAGCCTGCAGAGATCAGCG 915 

• • • • • 
1951 CCAGGGACTCGGTGGGCAACACGGTGCTGCACGCCCTGGTGGAGGTGGCC 2000 

II 1 1 M M I I f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 MMMMIIMMIM 
916 CCCGGGACTCAGTGGGCAACACGCTGCTTCATGCCCTGGTGGAGGTGGCA 965 

• • • • • 
2001 GACAACACGGCCGACAACACX»AAGTTTGTGACGAGCATGTACAATGAGAT 2050 

II (MM I IMIIIM Mill Mill I II INN Ml II Ml 
966 GATAACACAGTTGACAACACCAAGTTCGTGACAAGCATGTACAACGAGAT 1015 

• • • • • 
2051 TCTGATGCTGGGGGCCAAACTGCACOCGAOGCTGAAGCTGGAGGAGCTCA 2100 

1 1 1 1 ! 1 1 I M 1 1 II t f 1 1 Mill MMIIIMIMM Ml Ml 
1016 CTTGATCCTGGGGGCCAAACTCCACCCCACX5CTGAAGCTGGAAGAGATCA 1065 
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2101 CCAACMGAAGGGAATGACGCCGCTGGCTCTGGCAGCTGGGACCGGGAAG 2150 
Mlllt MINI I Mill IIMHIIMI III t I MHII 



1066 



CCAACAGGAAGGGGCTCACGCCACTGGCTCTGGCTGCTAGCAGTGGGAAG 



1115 



2251 
1216 
2301 
1266 



2151 ATCGGGGTCTTGGCCTATATTCTCCAGCGGGAGATCCAGGAGCCCGAGTG 2200 

I 1 1 1 I I II I I I I I I I I I MINIMI NIMIMM II IMMMI 
1116 ATCGGGGTCTTGGCCTACATTCTCCAGAGGGAGATCCATGAACCCGAGTG 1165 

2201 CAGGCACCTGTCCAGGAAGTTCACCGAGTGGGCCTACGGGCCCGTGCACT 2250 

I | Mill 1 1 1 1 1 f I I I f 1. 1 1 1 1 1 ■ MINIM Mill 'Ml 1 M 
1166 CCGACACCTATCCAGGAAGTTCACCGAATGGGCCTATGGGCCAGTGCACT 1215 

CCTCGCTGTACGACCTGTCCTGCATCGACACCTGCGAGAAGAACTCGGTG 2300 

I II I I I I I I I I I I I I I I M I Ml M II MM II II Ml 

CCTCCCTTTATGACCTGTCCTGCATTGACACCTGTGAAAAGAACTCGGTT 1265 

CTGGAGGTGATCGCCTACAGCAGCAGCGAGACCCCTAATCGCCACGACAT 2350 

till I I II I II MM I Mill III I M M M MIM 

CTGGAGGTGATCGCTTACAGCAGCAGTGAGACCCCTAACCGTCATGACAT 1315 

2351 GCTCTTGGTGGAGCCGCTGAACCGACTCXrTGCAGGACAAGTGGGACAGAT 2400 

Ml I li Mill I I-I I I I I I I I I I I 1 1 I 

1316 GCTTCTCGTGGAACCCTTGAACCGACTCCTACAGGACAAGTGGGACAGAT 1365 

• * * 

2401 TCGTCAAGCGCATCTTCTACTTCAACTTCCTGGTCTACTGCCTGTACATG 2450 

I M I Ml I III II I II I I I I M ill I M I II M II I M I I II III 
1366 TTGTCAAGCGCATCTTCTACTTCAACITCTTCGTCTACTGCTTGTATATG 1415 

2451 ATCATCTTCACCATGGCTGCCTACTACAGGCCCGTGGATGGCTTGCCTCC "2500 

MHII, IMIIIIIMM MM Mill IMMMI M 

1416 ATCATCTTCACCGCGGCTGCCTACTATCGGCCTGTGGAAGGCTTGCCCCC 1465 

2501 CTTTAAGATGGAAAAAA. . .TTGGAGACTATTTCCGAGTTACTGGAGAGA 2547 
2501 CTTTAA (|| ( |( |||j|M 

1466 CTATAAGCTGAAAAACACCGTTGGGGACTATTTCCGAGTCACCGGAGAGA 1515 

2548 TCCTGTCTGTGTTAGGAGGAGTCTACTTCTTTTTCCGAGGGATTCAGTAT 2597 

I I | fll I II I I I II 1 1 M I M 1 1 1 1 1 1 1 1 Ml MHII Ml 

1516 TCTTGTCTGTGTC3WKSAGGAGTCTACTTCTTCTTCCGAGGGATTCAATAT 1565 

• * 

25 98 TTCCTGCAGAGGCGGCCGTCGATGAAGACCCTGTTTGTGGACAGCTACAG 2647 

MmiMMIMIM M I MM IIMIMIIMMIIIMI 
1566 TTCCTGWGAGGCGACCATOCCTCAAGAGTITGTTTGTGGACAGCTACAG 1615 

. • • * 

2648 TGAGATGCTTTTCTTTCTGCAGTCACTGTTCATGCTGGCCACCGTGGTGC 2697 

, MM Ml 1 1 I Mill llll II Mil I II I "MM 

1616. TGAGATACTTTTCTTT6TACAGTCGCTGTTCATGCTGGTGTCTGTGGTAC 1665 

• • * * 

2698 TGTACTTCAGCCAOCTCAAGGASTATGTGGCTTCCATGGTATTCTCCCTG 2747 

IMIMMIMM I 1 1 1 M I II M I M M M M MMIMM 

1666 TGTACTTCAGCCAACGCAAGGAGTATGTGGCTTCCATGGTGTTCTCCCTG 1715 

2748 GCCTTGGGCTGGACCAACATGCTCTACTACACCCGCGGTTTCCAGCAGAT 2797 

III Ml MM I Ml M "MINIMI 

1716 GCCATGGGCTGGACCAACATGCTCTACTATACCCGAGGATTCCAGCAGAT 1765 
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2798 GGGCATCTATGCCGTCATGATAGAGAAGATGATCCTGAGAGACCTGTGCC 2847 

NIIIIIMIIt llllllll MIIIIIII Mill IIHtllMMM 
1766 GGGCATCTATGCTGTCATGATTGAGAAGATGATCCTCAGAGACCTGTGCC 1815 

2848 GTTTCATGTTTGTCTACATCGTCTTCTTGTTCGGGTTTTCCACAGCGGTG 2897 

! II Mill I I I I I I I I I I IMIIII! tl IIMilllllf I I f 
1816 GGTTTATGTTCGTCTACCTCGTGTTCTTGTTTGGATTTTCCACAGCTGTG 1865 

2898 GTGACGCTGATTGAAGACGGGAAGAATGACTCCCTGCCGTCTGAGTCCAC 2947 

Mill llllllll II MIIIIIII MM Mill llllllll 
1866 GTGACACTGATTGAGGATGGG AAGAATAACTCTCTGCCTATGGAGTCCAC 1915 

• m « > * 

2948 GTCGCACAGGTGGCGGGGGCCTGCCTGCAGGCCCCCCGATAGCTCCTACA 2997 

I MM Ml IMM I Mill! Ml I M I II Ml MM 
1916 ACCACACAAGTGCCGGGGGTCTGCCTGCAAG . . . CCAGGTAACTCTTACA 1962 

2998 ACAGCCTGTACTCCACCTGCCTGGAGCTGTTCAAGTTCACCATCGGCATG 3047 

I II II I I II I I If II II II I II II I If II II I M II II M I II I I M 
1963 ACAGCCTGTATTCCACATGTCTGGAGCTGTTCAAGTTCACCATCGGCATG 2012 

3048 GGCGACCTGGAGTTCACTGAGAACTATGACTTCAAGGCTGTCTTCATCAT 3097 

I M M I II M II I II I I I II M I I II II II M II II M II II I M I M I 
2013 GGCGACCTGGAGTTCACTGAGAACTACGACTTCAAGGCTGTCTTCATCAT 2062 

• • * * • • 
3098 CCTGCTGCTGGCCTATGTAATTCTCACCTACATCCTCCTGCTCAACATGC 3147 

MM I MMIIIIMI Ml I M I I I II M I M I I I II I M 1 1 1 1 M 
2063 CCTGTTACTGGCCTATGTGATTCTCACCTACATCCTTCTGCTCAACATGC 2112 

• • • a • 

3148 TCATCGCCCTCATGGGTGAGACTGTCAACAAGATCGCACAGGAGAGCAAG 3197 

MM II I II II M II M I M M M I II M II Mill IIIMIMI 
2113 TCATTGCTCTCATGGGTGAGACCGTCAACAAGATTGCACAAGAGAGCAAG 2162 

• • • • • 
3198 AACATCTGGAAGCTGCAGAGAGCCATCACCATCCTGGACACGGAGAAGAG 3247 

I M I II II I M M II II M I 1 1 M II M II I i II II II II llllllll 
2163 AACATCTGGAAGCTGCAGAGAGCCATCACCATCCTGGATACAGAGAAGAG 2212 

• • • ■ * • 
3248 CTTCCTTAAGTGCATGAGGAAGGCCTTCCGCTCAGGCAAGCTGCTGCAGG 3297 

MUM 1 M II I II II I I M II M 1 1 1 M M I 111 MIIMMIIMI 
2213 CTTCCTGAAGTGCATGAGGAAGGCCTTCCGCTCTGGCAAGCTGCT 2262 

• • * • • 
3298 TGGGGTACACACCTGATGGCAAGGACGACTACCGGTGGTGCTTCAGGGTG 3347 

MMII III IMM llllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIMIMI 
2263 TGGGGTTCACTCCTGACGGCAAGGATGACTACCGGTGGTGTTTCAGGGTG 2312 

• * • ♦ • 
3348 GACGAGGTGAACTGGACCACCTGGAA^CCAACGTGGGCATCATCAACGA 3397 

llllllll llllllll MUM MM MM Mill MMIIIIMI 
2313 GACGAGGTAAACTGGACTACCTGGAACACCAATGTGGGTATCATCAACGA 2362 

• • • • • 
3399 AGACCCXK5GCAACTGTGAGGGCGTCAAGCGCACCCTGAGCTTCTC(XrrGC 3447 

Mill I II M I Ml III MMMM I II III II I II II II I II II 1 1 
2363 GGACCCAGGCAACTGTGAGGGCGTCAAGCGCACCCTGAGCTTCTCCCTGA 2412 

• • • • • 
3448 GGTCAAGCAGAGTTTCAGGCAGACACTGGAAGAACTTTGCCCTGGTCCCC 3497 

Mill II I 1 1 1 1 1 1 f II Ml Mill I III II II II I I I I III Ml 
2413 GGTCAGGCCGAGTTTCAGGGAGAAACTGGAAGAACTTTGCrCTGGTTCCC 2462 

• • ♦ # • 
3498 CTTTTAAGAGAGGCAAGTGCTCGAGATAGGCAGTCTGCTCAGCCCGAGGA 3547 
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2463 'iiciJ^^^ " 12 



354 s - r *-™ ;;;; 

2513 AGTTCAACTGAAGCATTATACGGGATCCCTTAAGCCAGAGGATGCTGAGG 2562 



3598 TCTTCAAGAGTCCTGCCGCTTCCGGGGAGAAGTGA . GGACGTCACGCAGA 3646 
I I II III II I I I I I I I 1 1 I I ■ MM I I Ml 



2612 
3696 



2563 TTTTCAAGGATTCCATGGTCCCAGGGGAGAAATAATGGACACTATGCAGG 
3647 CAGCACTGTCAACACTGGGCCTTAGGAGACCCCGTTGCCACGGGGGGCTG 

2613 GATCAATG CGGGGTCTTTGGGTGGTCTG 2640 

3697 CTGAGGGAACACCAGTGCTCr™ 3746 
2641 CTTAGGGAAC . CAGCAGGGTTGACGTTATCTGGGTCCACTCTGTGCCTGC 2689 

3747 CCA.GCATgWaAATcWtGGACAAG^ 3795 

. iiii i i i I I II I I I I I I M I M I I l * 
2690 CTAGG^CAWCCTAGGACTTCGGCGGGCCTGCTGTGGGAA . CTGGGAGG 2738 

379 6 GAAGCATGGGGAGTGATGTACATCCAACCGTCACTG 3.45 
2739 TGTGTGGGAATTGAGATGTGTATCCAACCATGA. . . TCTCCAAACATTTG 2785 

384 6 TCCTAACAGACTTTCAGGTWTTA^^ 3895 
2786 GCTTTCAACTCTWATGGACTTTATTAAACAGAGTGAATGGCAAATCTCT 2835 

3896 AGGGCGGCCGCTTA 3909 

I Ml 
2836 ACTTGGACACAT . . 2847 
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GAP of: humanvrl.pep check: 6877 from: 1 to: 839 
humanVRl _Fbhl8547pat - fchrb87a6, 3909 bases, 4554 checksum, 
to: ratvrl.pep check: 5764 from: 1 to: 838 

ratVRl | AF029310 Rattus norvegicus vanilloid receptor subtype 1 mRNA 
complete ' 
cds. 

Symbol comparison table: 

/ddm_local/gcg/gcg_9. l/9cgcore/data/rundata/blosum62.cmp 
CompCheck: 6430 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 

Quality: 3734 Length: 840 

Ratio: 4.456 Gaps: 3 

Percent Similarity: 89.247 Percent Identity: 86.022 

Match display thresholds for the alignment (s) : 
I = IDENTITY 
: « 2 
. - 1 

humanvrl.pep x ratvrl.pep 

• • • * . 

1 MKKWSSTDLGTAADPLQKDTCPDPLDGDPNSRPPPAKPQLPTAKSRTRLF 50 

I-. .1 I . I I ... I II I til : I I t It : I : I I I I M 
1 MEQRASLDSEESESPPQENSCLDPPDRDPNCKPPPVKPHIFTTRSRTRLF 50 

• * • • . 

51 GKGDSEEAFPVDCPHEEGELDSCPTITVSPVITIQRPGDGPTGARLLSQD 100 

Mtllllt I. 111:111 I III till 1:111111111 | ||| 
51 GKGDSEEASPLDCPYEEGGLASCPIITVSSVLTIQRPGDGPASVRPSSQD 100 

101 SVAASTEKTLRLYDRRSIFEAVAQNNCQDLESLLLFLQKSKKHLTDKEFK 150 

It. I II IIMIII M:INI. 111:11111 lll:||| III. Ill 
101 SVSAG . EKPPRLYDRRSI FDAVAQSNCQELESLLPFLQRSKKRLTDSEFK 149 

151 DPETGKTCLLKAMLNLHDGQNTT 1 PLLLE I ARQTDSLKELVNAS YTDS Y Y 200 

Illlllil MMMIII.IIf II IM::IMIMI: llllllllll 
150 DPETGKTCLLKAMLNIJINGQNDTIAIJiLDVARKTDSLKQFVNASYTDSYY 199 

• • • ■ m 
201 KGQTALH I AIERRNMALVTLLVENGADVQAAAHGDFFKKTKGRPGFYFGE 250 

1 1 I I 1 1 1 I I 1 1 1 1 1 I MM I IMMIIMM.I III II II If I I I I II I 
200 KGQTALH I AI ERRNMTLVTLL VENGADVQAAANG D FFKKTKGR PGF YFG E 249 

• • * • « 
251 LPLSLAACTNQI/SI \HCFIJ^QNSKQTADI SARDSVGKTVLHALVE VAOKTA 300 

1 M I 1 I t f I f 1 1 I I I J I I t J I I J IMIIIIM I III II I II I I If! i 
250 LPLS LAACTNQLAI VKFLLQNSWQ PAD I SARDS VGNTVLHALVEVADNTV 299 

301 DNTKFVTSMYNE I LMLGAKLH PTLKLEELTNKKGMTPLAIAAGTGKI GVL 350 

MMIIIIIf U 11. II I MM MM I I: II: II: I llllll .|||||| 
300 DOTKFVTSMYNEILILGAKLHPTLKLEEITNRKGLTPLALAASSGKIGVX 349 

• • * • , 

351 AYILQREIQEPECRHLSRKFTEWAYGPVHSSLYDLSCIDTCEKNSVLEVI 400 

Illlllil I M I I t I I I I I I I N I M II I I 1 1 I I I I II I I I | I I 1 1 | | | 
350 AYILQREIHEPECRHLSRKFTEWAYGPVHSSLYDLSCIDTCEKNSVLEVI 399 
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401 AYSSSETPNRHDMLLVEPLNRLLQDKWDRFVKRIFyFNFLVyCLYMIIFT 450 

t ■ ■ • 1 1 1 1 1 1 1 1 1 | I I 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ■ I > II I Mill II 
400 AYSSSETPNRHDMLLVEPLNRLliQDKWDRFVKRI FYFNFFVYCLYMII FT 449 

451 MAAYYRPVDGLPPFKMEK . IGDYFRVTGEILSVI^VYFFFRGIQYFLQR 499 

I | IIHCI III: l:. : I 1 1 1 1 1 1 1 1 1 M I M I I 1 1 1 I II I 1 1 1 1 1 
450 AAAYYRPVEGLPPYKLKNTVGDYFRVTGEILSVSGGVYFFFRGIQYFLQR 



499 



599 
599 



650 



500 RPSMKTLFVDSYSEMLFFWSLFMUVTWLYFSHLKEYVASMVFS1AWW 549 

1 1 I • I 1 1 1 1 1 1 1 I • 1 1 1 . I 1 1 1 1 1 • I M I M I I I 1 1 1 I M I II : 1 1 
500 RPSLKSLFVDSYSEILFFVQSLFMLVSWLYFSQ 549 

550 TNMLYYTRGFQQMGIYAVMIEKMILRDLCRFMFVYIVFLFGFSTAWTL^ 

550 TOMU™ IIM , M, ||||||:IMIIII II 

550 TNMLYYTRGFQQMGIYAVMIEKMILRDLCRFMFVYLVFLFGFSTAWTLI 

600 EDGKNDSLPSESTSHRWRGPACRPPDSSYNSLYSTCLELFKFTIGMGDLE 649 

I I I I I III III I : II II: I . I I I I I I I I M I I I I I I I I I I I I I 
600 EDGKNNSLPMESTPHKCRGSACK. PGNSYNSLYSTCLELFKFTIGMGDLE 648 

FTENYDFKAVFI Z LLLAYVI LTYILLJjNMLIALMGETVNKJAQESKNI WK 699 

MM MINI 1 1 1 I I Ml M I II 1 1 II I 1 1 I II I I II II I 

649 FTENYDF^VFIILLIAYVILTYIL^ « 8 

700 LQRAITILDTEKSFLKCMRKAFRSGKLWVGYTPDGKDDYRWCFRVDEVN 749 

700 WRAITI1.D1 | m | | I I I It I I II I I I M M I I M I I I I I I ao 

699 LQRAI T 1 LDTEKS F1«KCMRKAFRSGKLLQVG FTPDGKDDYRWCFRVDEVN 748 
750 WTTWNTWGilNEDPGNCEGVKRTLSFSLRSSRVSGRHW^FALyP^RE 799 

I I 1 1 1 1 1 1 1 1 1 II 1 1 M I M M 1 1 M M M I M I M . 1 1 1 I M M I II : 

74 9 WTWNTNVG 1 INEDPGNCEGVKRTLS FSLRSGRVSGRNWKN FALVPLLRD 798 

800 ASARDRQSAQPEEVYLRQFSGSLKPEDAEVFKSPAASGEK 839 

II III • I Ml I: t.MIIMMIIM M I 

799 ASTRDRHATQQEEVQLKHYTGSLKPEDAEVFKDSMVPGEK 838 
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CLUSTAL H <1.74) multiple sequence alignment 



human VR2 .alt 
human VR2 



KTSPSSSPVFRI^UJGOQEDGSEADRGKIJDFaSGLPPMBSOW 



human VR2. alt 
human VR2 



RKGTGASQPDPNRFDRDRLFHAVSRGVPEDIAGLPEYT^CT^ EYTEGSTGKTCL 



humanVR2.alt 
humanVR2 



MKAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQ^^ 



humanVR2 .alt GRFFQKGOTTCFTFGELPI^LAACTKCKDWSVItLENPHOPASLQA 

humanVR2 I^VENGAWHARACGRFTOKGOGTCFYTOELPLSLAACT 

humanVR2 . alt TOSQGNTVLHALVMISDNSAEKIALVTSMYTCU^AGARI^PTVQIiED IRNLQDLTPLKL 

humanVR2 TDSOGNTVlJiALVMI SDNSAEKIALVTSMYDGIJjOAGARLCP I RNLQDLTPLKL 

humanVR2 . alt AAXEGKX £ I FRHILQREFSG J-SHLiSRKFTEWCYGP VR VSLYDLAS VDS CEENSVLEI IAP 

humanVR2 AAXEGKIEI FRHIIiQRBPSGI^HLSRKFTEWCT^ IAF 

humanVR2 . alt HC3CS PHRHRMWLE PI/JKLLQAKWDLLI PKFFDNFLQJL I YKFIFTRVAYHQPTLKKQAA 

humanVR2 HCKSPHRHRMVVLEPIJfKLLQAKKDI^ 

humanVR2 . alt PHLKAEVGNSKLLTGHXLXLLTCIYLIA^ 

humanVR2 PHLKMTVGNSMLLTGHILILLGGIYL^^ 

humanVR2 . alt TVVSQVLCFI^EWYI^LLVSALVI^^ - 

humanVR2 TVV5QVIX3TAIBW«*m.VSALVX^^ 



humanVR2.alt 
humanVU 



XYIiVFIWPAVALVSLSQEAHRPEM^ 



humanVR2.alt 
faus&nVR2 • 



humanVR2.alt --KKiaSVLBKBIGFnmC^ 

humanVR2 KLQI^&VI^KEKQYHHCRlCKQRAOVKLTVlA'XJ^iiGS PDERHCFRVEKVHHASWBQTLPT 



humanVR2 . alt LCZDPSGAgVPRTI^KPVIASPPimBDQA^ 
humanVR2 LCEDPSOA£^RTI£KPVIJ^PPKEDEIX3ASEQ^^ 
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I i i ii i i i. i i i i i i i " i i i 
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B IfH H UK H181 WHIM Mill ll H 111 Ml »B*a.F» W .G.rn»r.Retaon 

b— hi — I — HHH-CHOTtD— CHJJf^ BBeta - Re ^ sCh -- F «— 

T I f 11 I II (1 1 i 1 I 11 I HM HfHH— "^ ^C^on 

T illlilllUlllllllHlllllDllHllI I ill mint 11 DT -^- c — 
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Protein Family / Domain Matches, HMMer version 2 

Searching for complete domains 



0uer y hVR-l 



EST f0 Ses 8 c e S , ptn„ fa,,Uly <-ore includes aU domains,: 

Score E-value N 



£Hh Ank repeat 



Parsed for domains: 
^!-— 8eq - f hmm-f hmm-t score E-value 

«* 3/3 333 361 .. } JJ jj ^ £ 

Alignments of top-scoring domains: 

«&t domain 1 of 3. from 201 to 233: score 34.4. E = 2.6e-06 
* - >nGnTPLHlAarygnvewklLLehGAdvnar tk< -• 
*G-»T-fLH*A ♦ n+ *v lL*e+GAdv a+ 



51.5 1.9e-ll 



bVR-1 201 KGQTALHIAIERJUJMALVTLLVENGADVQAAAH 233 
«*. domain 2 of , f - ^ 

FGEtPLSIAACTHOLGIVKFLLOnswOTADlSAROS 283 
«*» domain 3 of 3 from 333 to 36l! ^ 3 .«, , „ 

hVR-l 333 KGMTPLAIAAQTGKIOVLAYTLQ REIQEP 3€1 



WO 00/29577 



PCT/US99/26701 



32/35 




WO 00729577 



PCT/US99/2670J 



33/35 



Protein Family / Domain Matches, BMWer version 2 

Searching for complete domains 

taqpfem - search a single ccq ao/ainst HMM database 
HHMER 2.1*1 (Dec 2998) 

Copyright (C) 1992-1998 Washington University School ot fcediclne 
RKMER Lb freely distributed under the ClbO General Public Incense (OTLJ . 

EKK filet /pro6ydd&/seaanai/PFAH/p£em3 - 3/PCam 

Sequence file: /t^/oxf anal .579 .aa 

Oaoxyt *lfc21ell 

Scares fox ceguence femily classification (ccor© include ell doaaine} i 
tfodal Description Score k -value .K 

ok FF00023 Ack repeat 53 .7 4e-12 3 

Parsed for domains: 

Kodel Domain eeq-f scq-c hnaa-f haa-t score E-volue 

aflk 1/3 162 194 .. *1 33 M 36.3 l,7e-07 

Mk 2/3 208 243 1 33 (1 €.4 4.3 

«tme 3/3 293 328 1 33 tl 8.8 2.1 

Alignments of top-scoring d^amainai • - 

softs domain 1 ot 3, from 162 to 194: score 38.3. & « 1.7*-07 
•->oCctWiaiAaryetiVcvVU^ * 

P2*21ell 1C2 T&tolUB&XBXm^ 1*4 

«*fcc domain 2 or 3. £ron 308 to 243 « eoorc 6.4, 8 -4.3 

«->o3aXEimMr yvutfuyivVa tte« • thB&flvnaxttec>* 
Q Fti l&a 4* «-nvr 4UUBfrtM- a*-«- 
rUttlAll 208 W«K<SL*XS1»^^ 243 

t domain 3 of 3. from 293 to 328 1 scare 8-8, 2.1 

*^>ncnlglfll^ary u^vww>J Uj|je* • .bOacamaxtcc-* 

4 4TPZ. !Aa44«44«t ♦ 1*4+ O ♦ 4T 

Flh21ell 293 ^w^^*wt^r*rftmT^t4nr£BiABKr 328 
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CLUSTAL W (1.74) multiple sequence alignment 

hVR2?2?FL ^ffff^ 

u^ nV ? 2 RKGTGASOPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEYTEGSTrirrnr 

hVR2 .altFL ?^ASQPDP^FD^ 

!?^ nV ^^ MKAVLNLKDGVNACILPLLOIDRDSGNPQPLVM^OCTDDYYRGHSAJUHIAIETO 

hVR2 . al tFL KDGV1*A.0 X X^> P 1^X>0 1 DRDS Gl^F^OF*!^ VNAOCTTDO V V RGMS AX*M X A X Ef^RS LOOVK 

£^ nV ? 2 ™ LLVENGANVHARACGRFFOKGOGTCFYFGELPLSL^CTKOWDWSYLLENPHOPASLOA 

hVR2 .altFL ^^^^^i^^^^^^ir^?^"^^ i=* F*<? ICG<^^3 T'CT ^"C3 £?X^ PX^S X-*AA.CTT JC^^'W^^'WS X^X^»£TN' ^5 X^^>!a 

humanVR2 TDSQGmVLHALVMISDNSAENIALVTSMYTC^^ 

hVR2 . altFL TDS^^I^LVMISDNSAENIALVTSOT^^ 

humanVR2 AAKEGKIEIFTUIILQREFSGLSHLSRKFTBWCYGPVRV^^ 

hVR2 . altFL ^jp:GKIEIFRHII^REFSGLSHI£RXFTEWCYGPVRVSLro^ 

humanVR2 HCKS PHRHRMWLE PLNKLLQAKWDLLI PKFFLNFLCNL I YMFI FTAVA YHOPTLKKDAA 

hVR2 . altFL HCKS PHRHRMWLEPLNKLLQAKWDLLI PKFFLNFLCNLI YMFI FTAVAYHQPTLKKQAA 

humanVR2 PHLKAEVGNSMLLTGHI L I LLGGI YLLVGQLWYFWRRHVFI WI S FI DS YFEILFLFOALJ 

hVR2 . altFL PHLKAEVGNSMLLTGHI LI LLGGI YLLVGQLWFWRRHVFI WI SFIDSYFEI LFLFQALL 

^, U ^ nV ? 2 TWSQVI^FLAIEOTLPLLVSALVLGWIJ^LYYTRGFQHTGIYSW 

hVR2 . altFL TWSQVLCFlAIEWYLPIiVSALVIOrLNLLYYTRGFOHTGIYSVMIQK - - - - . 

humanVR2 IYl>VFLFGFAVALVSLSQEAWRPEAPTGPNATESVQPMEGQEDEGNGAOYRGILEASLEL 

X1VK2 < 3xt FL — — — — — — — — • — » — »««_«.__ — .__ i<i- ____ 

human VR2 FKFT I GMGELAFQEQLHFRGMVLLLLIA YVLLTY I LLLNML IALMS ETVNS VATDS WS I W 

tlVR2 • a 1 1 FL — — — — _ «.__«.__ — — — — 

humanVR2 KLQKAISVLEMENGYWWCRKKQRAGVMLWGTKPDGSPDERWCFRV^ 

hVR2 .altFL - - - KAI S VLEMENGYWWCRKKQRAGVMLTVGTKPDGS PDERWCFRVEEVNWASWEQTLPT 

humanVR2 LCEDPSGAGVPRTLENPVLASPPKEDEDGASEENYVPVQLLQSN 

hVR2 .altFL LCEDPSGAGVPRTLENPVLASPPKEDEDGASEENYVPVQLLQSN 
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SEQUENCE LISTING * 
<110> MILLENNIUM PHARMACEUTICALS, INC. 

<120> NOVEL MEMBERS OF THE CAPSAICIN/VANILLOID RECEPTOR 
FAMILY OF PROTEINS AND USES THEREOF 

<130> MNI-062CP2PC 



<140> 
<141> 












<150> 60/108,322 
<151> 1998-11-13 












<150> 60/114,078 
<151> 1998-12-28 












<150> 09/258,633 
<151> 1999-02-26 












<150> 09/421,134 
<151> 1999-10-19 












<160> 20 












<170> Patent In Ver. 2.0 










<210> 1 
<211> 3909 
<212> DNA 

<213> Homo sapiens 












<220> 
<221> CDS 

<222> (1113) . . (3629) 












<400> 1 

gtgagcgcaa cgcactgcgg 


gcagtgagcg 


l~ CI CI l« ^ w 


cgggcagtga 


gcgcaacgca 


60 


ctgcgggcag tgagcgcaac 


gcactgcggg 


cagtgagcgc 


aacgcactgc gggcagtgag 


120 


cgcaacgcac tgcgggcagt 


gagcgcaacg 


cactgcgggc 


agtgagcgca 


acgcacttgc 


180 


gggcagtgag cgcaacgcac 


tgcgggcagt 


gagcgcaacg 


cactgcgggc 


agtgagcgca 


240 


acgcactgcg ggcagtgagc 


gcaacgcact 


gcgggcagtg 


agcgcaacgc 


actgcgggca 


300 


gtgagcgcaa cgcactgcgg 


gcagtgagcg 


caacgcactg 


cgggcagtga 


gcgcaacgca 


360 


ctgcgggcag tgagcgcaac 


gcactgcggg 


cagtgagcgc 


aacgcactgc 


gggcagtgag 


420 


cgcaacgcac ttaatgtgag 


ttagctcact 


cattaggcac 


cccaggcttt 


acactttatg 


480 


cttccggctc gtatgttgtg 


tggaattgtg 


agcggataac 


aatttcacac 


aggaaacagc 


540 


tatgaccatg attacgccaa 


gctctaatac 


gactcactat 


agggaaagct 


ggtacgcctg 


600 
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caggtaccgg tccggaattc ccgggtcgac ccacgcgtcc gaa'aacacac ctctctgctg 660 

tgggaagact gtgcaatggc acagccgcag agcttggttt gggaggttga agtgctctgg 720 

ggagaattcg tagatcatcc tcagaaaagc cttgccctgg tgttctacca gaaaaacgtc 780 

tcccaatcac ccagaaaagc tgtccacagt agtcccccct tatccacggg tgtcactttc 840 

catgggttca gttatttgcg gtcaaccacg gtctgccaat attaaatgga aaattcttca 900 

aacagttccc aagttttccc ttgtgcattg ttctgagcag tgtgatgaag agtctctgcc 960 

gtgccatctg ggatgcaaac cgtccctgtg tcccccacgt ccaggccgta gatgctcccc 1020 

gccggtcagt cacttagtcg tcagatcgcc cgtcctggta tcacagtgct tctgttcagg 1080 

ttgcacactg ggccacagag gatccagcaa gg atg aag aaa tgg age age aca 1133 

Met Lys Lys Trp Ser Ser Thr 
1 5 

gac ttg ggg aca get gcg gac cca etc caa aag gac acc tgc cca gac 1181 
Asp Leu Gly Thr Ala Ala Asp Pro Leu Gin Lys Asp Thr Cys Pro Asp 
10 15 20 

ccc ctg gat gga gac cct aac tec agg cca cct cca gee aag ccc cag 1229 
Pro Leu Asp Gly Asp Pro Asn Ser Arg Pro Pro Pro Ala Lys Pro Gin 
25 30 35 

etc ccc acg gee aag age cgc acc egg etc ttt ggg aag ggt gac teg 1277 
Leu Pro Thr Ala Lys Ser Arg Thr Arg Leu Phe Gly Lys Gly Asp Ser 
40 45 50 55 

gag gag get ttc ccg gtg gat tgc ccc cac gag gaa ggt gag ttg gac 1325 
Glu Glu Ala Phe Pro Val Asp Cys Pro His Glu Glu Gly Glu Leu Asp 
60 65 70 

tec tgc ccg acc ate aca gtc age cct gtt ate acc ate cag agg cca 1373 
Ser Cys Pro Thr lie Thr Val Ser Pro Val lie Thr lie Gin Arg Pro 
75 80 85 

gga gac ggc ccc acc ggt gee agg ctg ctg tec cag gac tct gtc gec 1421 
Gly Asp Gly Pro Thr Gly Ala Arg Leu Leu Ser Gin Asp Ser Val Ala 
90 95 100 

gee age acc gag aag acc etc agg etc tat gat cgc agg agt ate ttt 14 69 
Ala Ser Thr Glu Lys Thr Leu Arg Leu Tyr Asp Arg Arg Ser lie Phe 
105 110 115 

gaa gec gtt get cag aat aac tgc cag gat ctg gag age ctg ctg etc 1517 
Glu Ala Val Ala Gin Asn Asn Cys Gin Asp Leu Glu Ser Leu Leu Leu 
120 125 130 135 

ttc ctg cag aag age aag aag cac etc aca gac aac gag ttc aaa gac 1565 
Phe Leu Gin Lys Ser Lys Lys His Leu Thr Asp Asn Glu Phe Lys Asp 
140 145 150 

cct gag aca ggg aag acc tgt ctg ctg aaa gec atg etc aac ctg cac 1613 
Pro Glu Thr Gly Lys Thr Cys Leu Leu Lys Ala Met Leu Asn Leu His 
155 160 165 
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gcc ctg gtg acc etc ctg gtg gag aac gga gca gac gtc cag get gcg 
Ala Leu Val Thr Leu Leu Val Glu Asn Gly Ala Asp Val Gin Ala Ala 
220 225 230 

gcc cat ggg gac ttc ttt aag aaa acc aaa ggg egg cct gga ttc tac 
Ala His Gly Asp Phe Phe Lys Lys Thr Lys Gly Arg Pro Gly Phe Tyr 
235 240 245 

ttc ggt gaa ctg ccc ctg tec ctg gcc gcg tgc acc aac cag ctg ggc 
Phe Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Asn Gin Leu Gly 
250 255 260 

ate gtg aag ttc ctg ctg cag aac tec tgg cag acg gcc gac ate age 
lie Val Lys Phe Leu Leu Gin Asn Ser Trp Gin Thr Ala Asp He Ser 
265 270 275 



1709 



gac gga cag aac acc acc ate ccc ctg etc ctg gag ate gcg egg caa 1661 

Asp Gly Gin Asn Thr Thr He Pro Leu Leu Leu Glu He Ala Arg Gin 
170 175 180 

acg gac age ctg aag gag ctt gtc aac gcc age tac acg gac age tac 

Thr Asp Ser Leu Lys Glu Leu Val Asn Ala Ser Tyr Thr Asp Ser Tyr 
185 190 195 

tac aag ggc cag aca gca ctg cac ate gcc ate gag aga cgc aac atg 1757 

Tyr Lys Gly Gin Thr Ala Leu His He Ala He Glu Arg Arg Asn Met 

200 205 210 215 



1805 



1853 



1901 



1949 



gcc agg gac teg gtg ggc aac acg gtg ctg cac gcc ctg gtg gag gtg 1997 
Ala Arg Asp Ser Val Gly Asn Thr Val Leu His Ala Leu Val Glu Val 
280 285 290 295 

gcc gac aac acg gcc gac aac acg aag ttt gtg acg age atg tac aat 2045 
Ala Asp Asn Thr Ala Asp Asn Thr Lys Phe Val Thr Ser Met Tyr Asn 
300 305 310 

gag att ctg atg ctg ggg gcc aaa ctg cac ccg acg ctg aag ctg gag 
Glu He Leu Met Leu Gly Ala Lys Leu His Pro Thr Leu Lys Leu Glu 
315 320 325 

gag etc acc aac aag aag gga atg acg ccg ctg get ctg gca get ggg 2141 
Glu Leu Thr Asn Lys Lys Gly Met Thr Pro Leu Ala Leu Ala Ala Gly 
330 335 340 



2093 



acc ggg aag ate ggg gtc ttg gcc tat att etc cag egg gag ate cag 2189 
Thr Gly Lys He Gly Val Leu Ala Tyr He Leu Gin Arg Glu He Gin 
345 350 355 

gag ccc gag tgc agg cac ctg tec agg aag ttc acc gag tgg gcc tac 2237 

Glu Pro Glu Cys Arg His Leu Ser Arg Lys Phe Thr Glu Trp Ala Tyr 

360 365 370 375 

ggg ccc gtg cac tec teg ctg tac gac ctg tec tgc ate gac acc tgc 

Gly Pro Val His Ser Ser Leu Tyr Asp Leu Ser Cys He Asp Thr Cys 
380 385 390 

gag aag aac teg gtg ctg gag gtg ate gcc tac age age age gag acc 2333 

Glu Lys Asn Ser Val Leu Glu Val He Ala Tyr Ser Ser Ser Glu Thr 



2285 
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395 400 405 

cct aat cgc cac gac atg etc ttg gtg gag ccg ctg aac cga etc ctg 2381 
Pro Asn Arg His Asp Met Leu Leu Val Glu Pro Leu Asn Arg Leu Leu 
410 415 420 



cag gac aag tgg gac aga ttc gtc aag cgc ate ttc tac ttc aac ttc 2429 
Gin Asp Lys Trp Asp Arg Phe Val Lys Arg lie Phe Tyr Phe Asn Phe 
425 430 435 

ctg gtc tac tgc ctg tac atg ate ate ttc ace atg get gee tac tac 2477 
Leu Val Tyr Cys Leu Tyr Met He He Phe Thr Met Ala Ala Tyr Tyr 
440 445 450 455 

agg ccc gtg gat ggc ttg cct ccc ttt aag atg gaa aaa att gga gac 2525 
Arg Pro Val Asp Gly Leu Pro Pro Phe Lys Met Glu Lys He Gly Asp 
460 465 470 

tat ttc cga gtt act gga gag ate ctg tct gtg tta gga gga gtc tac 2573 
Tyr Phe Arg Val Thr Gly Glu He Leu Ser Val Leu Gly Gly Val Tyr 
475 480 485 

ttc ttt ttc cga ggg att cag tat ttc ctg cag agg egg ccg teg atg 2621 
Phe Phe Phe Arg Gly He Gin Tyr Phe Leu Gin Arg Arg Pro Ser Met 
490 495 500 

aag acc ctg ttt gtg gac age tac agt gag atg ctt ttc ttt ctg cag 2669 
Lys Thr Leu Phe Val Asp Ser Tyr Ser Glu Met Leu Phe Phe Leu Gin 
505 510 515 

tea ctg ttc atg ctg gec acc gtg gtg ctg tac ttc age cac etc aag 2717 
Ser Leu Phe Met Leu Ala Thr Val Val Leu Tyr Phe Ser His Leu Lys 
520 525 530 535 

gag tat gtg get tec atg gta ttc tec ctg gec ttg ggc tgg acc aac 2765 
Glu Tyr Val Ala Ser Met Val Phe Ser Leu Ala Leu Gly Trp Thr Asn 
540 545 550 

atg etc tac tac acc cgc ggt ttc cag cag atg ggc ate tat gee gtc - 2813 
Met Leu Tyr Tyr Thr Arg Gly Phe Gin Gin Met Gly He Tyr Ala Val 
555 560 565 

atg ata gag aag atg ate ctg aga gac ctg tgc cgt ttc atg ttt gtc 2861 
Met He Glu Lys Met He Leu Arg Asp Leu Cys Arg Phe Met Phe Val 
570 575 580 

tac ate gtc ttc ttg ttc ggg ttt tec aca gcg gtg gtg acg ctg att 2909 
Tyr He Val Phe Leu Phe Gly Phe Ser Thr Ala Val Val Thr Leu He 
585 590 595 

gaa gac ggg aag aat gac tec ctg ccg tct gag tec acg teg cac agg 2957 
Glu 'Asp Gly Lys Asn Asp Ser Leu Pro Ser Glu Ser Thr Ser His Arg 
600 605 610 615 

tgg egg ggg cct gee tgc agg ccc ccc gat age tec tac aac age ctg 3005 
Trp Arg Gly Pro Ala Cys Arg Pro Pro Asp Ser Ser Tyr Asn Ser Leu 
620 625 630 
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tac tec acc tgc ctg gag ctg ttc aag ttc acc ate ggc atg ggc gac 

Tyr Ser Thr Cys Leu Glu Leu Ph Lys Ph Thr lie Gly Met Gly Asp 
635 640 645 

ctg gag ttc act gag aac tat gac ttc aag get gtc ttc ate ate ctg 

Leu Glu Phe Thr Glu Asn Tyr Asp Phe Lys Ala Val Phe lie He Leu 
650 655 660 



3053 



3101 



ctg ctg gec tat gta att etc acc tac ate etc ctg etc aac atg etc 
Leu Leu Ala Tyr Val He Leu Thr Tyr He Leu Leu Leu Asn Met Leu 
665 670 675 

ate gec etc atg ggt gag act gtc aac aag ate gca cag gag age aag 
He Ala Leu Met Gly Glu Thr Val Asn Lys He Ala Gin Glu Ser Lys 
680 685 690 695 

aac ate tgg aag ctg cag aga gee ate acc ate ctg gac acg gag aag 
Asn He Trp Lys Leu Gin Arg Ala He Thr He Leu Asp Thr Glu Lys 
700 705 710 

age ttc ctt aag tgc atg agg aag gee ttc cgc tea ggc aag ctg ctg 
Ser Phe Leu Lys Cys Met Arg Lys Ala Phe Arg Ser Gly Lys Leu Leu 
715 720 725 



3149 



3197 



3245 



3293 



cag gtg ggg tac aca cct gat ggc aag gac gac tac egg tgg tgc ttc 
Gin Val Gly Tyr Thr Pro Asp Gly Lys Asp Asp Tyr Arg Trp Cys Phe 
730 735 740. 

agg gtg gac gag gtg aac tgg acc acc tgg aac acc aac gtg ggc ate 
Arg Val Asp Glu Val Asn Trp Thr Thr Trp Asn Thr Asn Val Gly He 
745 750 755 

ate aac gaa gac ccg ggc aac tgt gag ggc gtc aag cgc acc ctg age 
He Asn Glu Asp Pro Gly Asn Cys Glu Gly Val Lys Arg Thr Leu Ser 
760 765 770 775 

ttc tec ctg egg tea age aga gtt tea ggc aga cac tgg aag aac ttt 
Phe Ser Leu Arg Ser Ser Arg Val Ser Gly Arg His Trp Lys Asn Phe 
780 785 790 

gec ctg gtc ccc ctt tta aga gag gca agt get cga gat agg cag tct 
Ala Leu Val Pro Leu Leu Arg Glu Ala Ser Ala Arg Asp Arg Gin Ser 
795 800 805 

get cag ccc gag gaa gtt tat ctg cga cag ttt tea ggg tct ctg aag 
Ala Gin Pro Glu Glu Val Tyr Leu Arg. Gin Phe Ser Gly Ser Leu Lys 
810 815 820 



3341 



3389 



3437 



3485 



3533 



3581 



cca gag gac get gag gtc ttc aag agt cct gec get tec ggg gag aag 
Pro Glu Asp Ala Glu Val Phe Lys Ser Pro Ala Ala Ser Gly Glu Lys 
825 830 835 



3629 



tgaggacgtc aegcagacag cactgtcaac actgggcett aggagacccc gttgecaegg 3689 
.ggggctgctg agggaacacc agtgctctgt cagcagcctg gcctggtctg tgcctgccca 374 9 
gcatgttccc aaatctgtgc tggacaagct gtgggaagcg ttcttggaag catggggagt 3809 
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gatgtacatc caaccgtcac tgtccccaag tgaatctcct aacagacttt caggttttta 3869 
ctcactttac taaaaaaaaa aaaaaaaggg cggccgctta 3909 



<210> 2 
<211> 839 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Lys Lys Trp Ser Ser Thr Asp Leu Gly Thr Ala Ala Asp Pro Leu 
! 5 10 15 

Gin Lys Asp Thr Cys Pro Asp Pro Leu Asp Gly Asp Pro Asn Ser Arg 
20 25 30 

Pro Pro Pro Ala Lys Pro Gin Leu Pro Thr Ala Lys Ser Arg Thr Arg 
35 40 45 

Leu Phe Gly Lys Gly Asp Ser Glu Glu Ala Phe Pro Val Asp Cys Pro 
50 55 60 

His Glu Glu Gly Glu Leu Asp Ser Cys Pro Thr He Thr Val Ser Pro 
65 70 75 80 

Val He Thr He Gin Arg Pro Gly Asp Gly Pro Thr Gly Ala Arg Leu 
85 90 95 

Leu Ser Gin Asp Ser Val Ala Ala Ser Thr Glu Lys Thr Leu Arg Leu 
100 105 HO 

Tyr Asp Arg Arg Ser He Phe Glu Ala Val Ala Gin Asn Asn Cys Gin 
115 120 125 

Asp Leu Glu Ser Leu Leu Leu Phe Leu Gin Lys Ser Lys Lys His Leu 
130 135 140 

Thr Asp Asn Glu' Phe Lys Asp Pro Glu Thr Gly Lys Thr Cys Leu Leu 
145 150 155 160 

Lys Ala Met Leu Asn Leu His Asp Gly Gin Asn Thr Thr He Pro Leu 
165 170 175 

Leu Leu Glu He Ala Arg Gin Thr Asp Ser Leu Lys Glu Leu Val Asn 
180 185 190 

Ala Ser Tyr Thr Asp Ser Tyr Tyr Lys Gly Gin Thr Ala Leu His He 
195 200 205 

Ala He Glu Arg Arg Asn Met Ala Leu Val Thr Leu Leu Val Glu Asn. 
210 215 220 
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Gly Ala Asp Val Gin Ala Ala Ala 
225 230 

Lys Gly Arg Pro Gly Phe Tyr Ph 
245 

Ala Cys Thr Asn Gin Leu Gly lie 
260 

Trp Gin Thr Ala Asp lie Ser Ala 
275 280 



His Gly Asp Phe Phe Lys Lys Thr 
235 240 

Gly Glu Leu Pro Leu Ser Leu Ala 
250 255 

Val Lys Phe Leu Leu Gin Asn Ser 
265 270 

Arg Asp Ser Val Gly Asn Thr Val 
285 



Leu His Ala Leu Val Glu Val Ala Asp Asn Thr Ala Asp Asn Thr Lys 
290 295 300 

Phe Val Thr Ser Met Tyr Asn Glu He Leu Met Leu Gly Ala Lys Leu 
305 310 315 320 

His Pro Thr Leu Lys Leu Glu Glu Leu Thr Asn Lys Lys Gly Met Thr 
325 330 335 

Pro Leu Ala Leu Ala Ala Gly Thr Gly Lys He Gly Val Leu Ala Tyr 
340 345 350 

He Leu Gin Arg Glu He Gin Glu Pro Glu Cys Arg His Leu Ser Arg 
355 360 365 

Lys Phe Thr Glu Trp Ala Tyr Gly Pro Val His Ser Ser Leu Tyr Asp 
370 375 380 

Leu Ser Cys He Asp Thr Cys Glu Lys Asn Ser Val Leu Glu Val He 
385 390 395 400 

Ala Tyr Ser Ser Ser Glu Thr Pro Asn Arg His Asp Met Leu Leu Val 
405 410 415 

Glu Pro Leu Asn Arg Leu Leu Gin Asp Lys Trp Asp Arg Phe Val Lys 
420 425 430 

Arg He Phe Tyr Phe Asn Phe Leu Val Tyr Cys Leu Tyr Met He He 
435 440 445 

Phe Thr Met Ala Ala Tyr Tyr Arg Pro Val Asp Gly Leu Pro Pro Phe 
450 455 460 

Lys Met Glu Lys He Gly Asp Tyr Phe Arg Val Thr Gly Glu He Leu 
465 470 475 480 

Ser Val Leu Gly Gly Val Tyr Phe Phe Phe Arg Gly He Gin Tyr Phe 
485 490 495 

Leu Gin Arg Arg Pro Ser Met Lys Thr Leu Phe Val Asp Ser Tyr Ser 
500 505 510 

Glu Met Leu Phe Phe Leu Gin Ser Leu Phe Met Leu Ala Thr Val Val 
515 520 525 

Leu Tyr Phe Ser His Leu Lys Glu Tyr Val Ala Ser Met Val Phe Ser 
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530 



535 



540 



Leu Ala Leu Gly Trp Thr Asn Met Leu Tyr Tyr Thr Arg Gly Phe Gin 
545 550 555 560 

Gin Met Gly lie Tyr Ala Val Met lie Glu Lys Met lie Leu Arg Asp 
565 570 575 

Leu Cys Arg Phe Met Phe Val Tyr He Val Phe Leu Phe Gly Phe Ser 
580 585 590 

Thr Ala Val Val Thr Leu He Glu Asp Gly Lys Asn Asp Ser Leu Pro 
595 600 605 



Ser Glu Ser Thr Ser His Arg Trp Arg Gly Pro Ala Cys Arg Pro Pro 
610 615 620 

Asp Ser Ser Tyr Asn Ser Leu Tyr Ser Thr Cys Leu Glu Leu Phe Lys 
625 630 635 640 

Phe Thr He Gly Met Gly Asp Leu Glu Phe Thr Glu Asn Tyr Asp Phe 
645 650 655 

Lys Ala Val Phe He He Leu Leu Leu Ala Tyr Val He Leu Thr Tyr 
660 665 670 

He Leu Leu Leu Asn Met Leu lie Ala Leu Met Gly Glu Thr Val Asn 
675 680 685 

Lys He Ala Gin Glu Ser Lys Asn He Trp Lys Leu Gin Arg Ala He 
690 695 700 

Thr He Leu Asp Thr Glu Lys Ser Phe Leu Lys Cys Met Arg Lys Ala 
705 710 715 720 

Phe Arg Ser Gly Lys Leu Leu Gin Val Gly Tyr Thr Pro Asp Gly Lys 
725 730 735 

Asp Asp Tyr Arg Trp Cys Phe Arg Val Asp Glu Val Asn Trp Thr Thr 
740 745 750 

Trp Asn Thr Asn Val Gly lie lie Asn Glu Asp Pro Gly Asn Cys Glu 
755 760 765 

Gly Val Lys Arg Thr Leu Ser Phe Ser Leu Arg Ser Ser Arg Val Ser 
770 775 780 

Gly Arg His Trp Lys Asn Phe Ala Leu Val Pro Leu Leu Arg Glu Ala 
7B5 790 795 800 

Ser Ala Arg Asp Arg Gin Ser Ala Gin Pro Glu Glu Val Tyr Leu Arg 
805 810 815 

Gin Phe Ser Gly Ser Leu Lys Pro Glu Asp Ala Glu Val Phe Lys Ser 
820 825 830 



Pro Ala Ala Ser Gly Glu Lys 
835 
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<210> 3 

<211> 2517 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) (2517) 

<400> 3 

atg aag aaa tgg age age aca gac ttg ggg aca get gcg gac cca etc 48 

Met Lys Lys Trp Ser Ser Thr Asp Leu Gly Thr Ala Ala Asp Pro Leu 
15 10 15 



caa aag gac ace tgc cca gac ccc ctg gat gga gac cct aac tec agg 96 
Gin Lys Asp Thr Cys Pro Asp Pro Leu Asp Gly Asp Pro Asn Ser Arg 
20 25 30 

cca cct cca gee aag ccc cag etc ccc acg gee aag age cgc acc egg 14 4 
Pro Pro Pro Ala Lys Pro Gin Leu Pro Thr Ala Lys Ser Arg Thr Arg 
35 40 45 

etc ttt ggg aag ggt gac teg gag gag get ttc ccg gtg gat tgc ccc 192 
Leu Phe Gly Lys Gly Asp Ser Glu Glu Ala Phe Pro Val Asp Cys Pro 
50 55 60 



cac gag gaa ggt gag ttg gac tec tgc ccg acc ate aca gtc age cct 
His Glu Glu Gly Glu Leu Asp Ser Cys Pro Thr He Thr Val Ser Pro 
65 70 75 80 



240 



gtt ate acc ate cag agg cca gga gac ggc ccc acc ggt gee agg ctg 288 
Val He Thr He Gin Arg Pro Gly Asp Gly Pro Thr Gly Ala Arg Leu 
85 90 95 

ctg tec cag gac tct gtc gec gee age acc gag aag acc etc agg etc 336 
Leu Ser Gin Asp Ser Val Ala Ala Ser Thr Glu Lys Thr Leu Arg Leu 
100 105 110 

tat gat cgc agg agt ate ttt gaa gee gtt get cag aat aac tgc cag 384 
Tyr Asp Arg Arg Ser He Phe Glu Ala Val Ala Gin Asn Asn Cys Gin 
115 120 125 

gat ctg gag age ctg ctg etc ttc ctg cag aag age aag aag cac etc 432 
Asp Leu Glu Ser Leu Leu Leu Phe Leu Gin Lys Ser Lys Lys His Leu 
130 135 140 



aca gac aac gag ttc aaa gac cct gag 

Thr Asp Asn Glu Phe Lys Asp Pro Glu 
145 150 

aaa gee atg etc aac ctg cac gac gga 

Lys Ala Met Leu Asn Leu His Asp Gly 
165 

etc ctg gag ate gcg egg caa acg gac 



aca ggg aag acc tgt ctg ctg 480 
Thr Gly Lys Thr Cys Leu Leu 
155 160 

cag aac acc acc ate ccc ctg 528 
Gin Asn Thr Thr He Pro Leu 
170 175 

age ctg aag gag ctt gtc aac 576 
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Leu Leu Glu lie Ala Arg Gin Thr Asp Ser Leu Lys Glu Leu Val Asn 
180 185 190 

gcc age tac acg gac age tac tac aag ggc cag aca gca ctg cac ate 624 
Ala Ser Tyr Thr Asp Ser Tyr Tyr Lys Gly Gin Thr Ala Leu His lie 
195 200 205 

gcc ate gag aga cgc aac atg gcc ctg gtg acc etc ctg gtg gag aac 672 
Ala lie Glu Arg Arg Asn Met Ala Leu Val Thr Leu Leu Val Glu Asn 
210 215 220 

gga gca gac gtc cag get gcg gcc cat ggg gac ttc ttt aag aaa acc 720 
Gly Ala Asp Val Gin Ala Ala Ala His Gly Asp Phe Phe Lys Lys Thr 
225 230 235 240 

aaa ggg egg cct gga ttc tac ttc ggt gaa ctg ccc ctg tec ctg gcc 768 
Lys Gly Arg Pro Gly Phe Tyr Phe Gly Glu Leu Pro Leu Ser Leu Ala 
245 250 255 

gcg tgc acc aac cag ctg ggc ate gtg aag ttc ctg ctg cag aac tec 816 
Ala Cys Thr Asn Gin Leu Gly lie Val Lys Phe Leu Leu Gin Asn Ser 
260 265 270 

tgg cag acg gcc gac ate age gcc agg gac teg gtg ggc aac acg gtg 864 
Trp Gin Thr Ala Asp lie Ser Ala Arg Asp Ser Val Gly Asn Thr Val 
275 280 285 

ctg cac gcc ctg gtg gag gtg gcc gac aac acg gcc gac aac acg aag 912 
Leu His Ala Leu Val Glu Val Ala Asp Asn Thr Ala Asp Asn Thr Lys 
290 295 300 

ttt gtg acg age atg tac aat gag att ctg atg ctg ggg gcc aaa ctg 960 
Phe Val Thr Ser Met Tyr Asn Glu He Leu Met Leu Gly Ala Lys Leu 
305 310 315 320 

cac ccg acg ctg aag ctg gag gag etc acc aac aag aag gga atg acg 1008 
His Pro Thr Leu Lys Leu Glu Glu Leu Thr Asn Lys Lys Gly Met Thr 
325 330 335 

ccg ctg get ctg gca get ggg acc ggg aag ate ggg gtc ttg gcc tat 1056 
Pro Leu Ala Leu Ala Ala Gly Thr Gly Lys He Gly Val Leu Ala Tyr 
340 345 350 

att etc cag egg gag ate cag gag ccc gag tgc agg cac ctg tec agg 1104 
He Leu Gin Arg Glu He Gin Glu Pro Glu Cys Arg His Leu Ser Arg 
355 360 365 

aag ttc acc gag tgg gcc tac ggg ccc gtg cac tec teg ctg tac gac 1152 
Lys Phe Thr Glu Trp Ala Tyr Gly Pro Val His Ser Ser Leu Tyr Asp 
370 375 380 

ctg tec tgc ate gac acc tgc gag aag aac teg gtg ctg gag gtg ate 1200 
Leu Ser Cys lie Asp Thr Cys Glu Lys Asn Ser Val Leu Glu Val lie 
385 390 395 400 

gcc tac age age age. gag acc cct aat cgc cac gac atg etc ttg gtg 1248 
Ala Tyr Ser Ser Ser Glu Thr Pro Asn Arg His Asp Met Leu Leu Val 
405 410 415 
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gag ccg ctg aac cga etc ctg cag gac aag tgg gac aga ttc gtc aag 
Glu Pro L u Asn Arg Leu Leu Gin Asp Lys Trp Asp Arg Phe Val Lys 
420 425 430 

cgc ate ttc tac ttc aac ttc ctg gtc tac tgc ctg tac atg ate ate 
Arg He Phe Tyr Phe Asn Phe Leu Val Tyr Cys Leu Tyr Met He He 
435 440 445 

ttc acc atg get gee tac tac agg ccc gtg gat ggc ttg cct ccc ttt 
Phe Thr Met Ala Ala Tyr Tyr Arg Pro Val Asp Gly Leu Pro Pro Phe 
450 . 455 460 

aag atg gaa aaa att gga gac tat ttc cga gtt act gga gag ate ctg 
Lys Met Glu Lys He Gly Asp Tyr Phe Arg Val Thr Gly Glu He Leu 
465 470 475 480 

tct gtg tta gga gga gtc tac ttc ttt ttc cga ggg att cag tat ttc 
Ser Val Leu Gly Gly Val Tyr Phe Phe Phe Arg Gly He Gin Tyr Phe 
485 490 495 



gag atg ctt ttc ttt ctg cag tea ctg ttc atg ctg gec acc gtg gtg 
Glu Met Leu Phe Phe Leu Gin Ser Leu Phe Met Leu Ala Thr Val Val 
515 520 525 

ctg tac ttc age cac etc aag gag tat gtg get tec atg gta ttc tec 
Leu Tyr Phe Ser His Leu Lys Glu Tyr Val Ala Ser Met Val Phe Ser 
530 535 540 

ctg gee ttg ggc tgg acc aac atg etc tac tac acc cgc ggt ttc cag 
Leu Ala Leu Gly Trp Thr Asn Met Leu Tyr Tyr Thr Arg Gly Phe Gin 
545 550 555 560 

cag atg ggc ate tat gee gtc atg ata gag aag atg ate ctg aga gac 
Gin Met Gly He Tyr Ala Val Met He Glu Lys Met lie Leu Arg Asp 
565 570 575 

ctg tgc cgt ttc atg ttt gtc tac ate gtc ttc ttg ttc ggg ttt tec 
Leu Cys Arg Phe Met Phe Val Tyr He Val Phe Leu Phe Gly Phe Ser 
580 585 590 



aca gcg gtg gtg acg ctg att gaa gac ggg aag aat gac tec ctg ccg 
Thr Ala Val Val Thr Leu He Glu Asp Gly Lys Asn Asp Ser Leu Pro 
595 600 605 

tct gag tec acg teg cac agg tgg egg ggg cct gee tgc agg ccc ccc 
Ser Glu Ser Thr Ser His Arg Trp Arg Gly Pro Ala Cys Arg Pro Pro 
610 615 620 

gat age tec tac aac age ctg tac tec acc tgc ctg gag ctg ttc aag 
Asp Ser Ser Tyr Asn Ser Leu Tyr Ser Thr Cys Leu Glu Leu Phe Lys 
625 630 635 640 

ttc acc ate ggc atg ggc gac ctg gag ttc act gag aac tat gac ttc 



1296 



1344 



1392 



1440 



1468 



ctg cag agg egg ccg teg atg aag acc ctg ttt gtg gac age tac agt 1536 
Leu Gin Arg Arg Pro Ser Met Lys Thr Leu Phe Val Asp Ser Tyr Ser 
500 505 510 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1968 
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Phe Thr He Gly M t Gly Asp L u Glu Phe Thr Glu Asn Tyr Asp Phe 
64 $ 650 655 

aag get gtc ttc ate ate ctg ctg ctg gec tat gta att etc acc tac 2016 
Lys Ala Val Phe He He Leu Leu Leu Ala Tyr Val He Leu Thr Tyr 
66 ° 665 670 

ate etc ctg etc aac atg etc ate gee etc atg ggt gag act gtc aac 2064 
He Leu Leu Leu Asn Met Leu He Ala Leu Met Gly Glu Thr Val Asn 
675 680 685 

aag ate gca cag gag age aag aac ate tgg aag ctg cag aga gec ate 2112 
Lys lie Ala Gin Glu Ser Lys Asn He Trp Lys Leu Gin Arg Ala He 
690 695 700 

acc ate ctg gac acg gag aag age ttc ctt aag tgc atg agg aag gee 2160 
Thr He Leu Asp Thr Glu Lys Ser Phe Leu Lys Cys Met Arg Lys Ala 
705 710 715 720 

ttc cgc tea ggc aag ctg ctg cag gtg ggg tac aca cct gat ggc aag 2208 
Phe Arg Ser Gly Lys Leu Leu Gin Val Gly Tyr Thr Pro Asp Gly Lys 
? 25 730 735 

gac gac tac egg tgg tgc ttc agg gtg gac gag gtg aac tgg acc acc 2256 
Asp Asp Tyr Arg Trp Cys Phe Arg Val Asp Glu Val Asn Trp Thr Thr 
740 745 750 

tgg aac acc aac gtg ggc ate ate aac gaa gac ccg ggc aac tgt gag 2304 
Trp Asn Thr Asn Val Gly He lie Asn Glu Asp Pro Gly Asn Cys Glu 
755 760. 765 

ggc gtc aag cgc acc ctg age ttc tec ctg egg tea age aga gtt tea 2352 
Gly Val Lys Arg Thr Leu Ser Phe Ser Leu Arg Ser Ser Arg Val Ser 
770 775 780 

ggc aga cac tgg aag aac ttt gee ctg gtc ccc ctt tta aga gag gca 2400 
Gly Arg His Trp Lys Asn Phe Ala Leu Val Pro Leu Leu Arg Glu Ala 
785 790 795 800 

agt get cga gat agg cag tct get cag ccc gag gaa gtt tat ctg cga 244 8 
Ser Ala Arg Asp Arg Gin Ser Ala Gin Pro Glu Glu Val Tyr Leu Arg 
805 810 815 



cag ttt tea ggg tct ctg aag cca gag gac get gag gtc ttc aag agt 24 96 
Gin Phe Ser Gly Ser Leu Lys Pro Glu Asp Ala Glu Val Phe Lys Ser 
820 825 830 

cct gee get tec ggg gag aag 2517 
Pro Ala Ala Ser Gly Glu Lys 
835 

<210> 4 

<211> 2809 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
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<222> (361) . . (2652) 
<400> 4 

ggctagcctg tcctgacagg ggagagttaa gctcccgttc tccaccgtgc cggctggcca 60 

ggtgggctga gggtgaccga gagaccagaa cctgcttgct ggagcttagt gctcagagct 120 

ggggagggag gttccgccgc tcctctgctg tcagcgccgg cagcccctcc cggcttcact 180 

tcctcccgca gcccctgcta ctgagaagct ccgggatccc agcagccgcc acgccctggc 240 

ctcagcctgc ggggctccag tcaggccaac accgacgcgc agctgggagg aagacaggac 300 

ccttgacatc tccatctgca cagaggtcct ggctggaccg agcagcctcc tcctcctagg 360 



atg acc tea ccc tec age tct cca gtt ttc agg ttg gag aca tta gat 
Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu Asp 
15 10 15 



408 



gga ggc caa gaa gat ggc tct gag gcg gac aga gga aag ctg gat ttt 456 
Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu Asp Phe 
20 25 30 



ggg age ggg ctg cct ccc atg gag tea cag ttc cag ggc gag gac egg 504 
Gly Ser Gly Leu Pro Pro Met Glu Ser Gin Phe Gin Gly Glu Asp Arg 
35 40 45 

aaa ttc gec cct cag ata aga gtc aac etc aac tac cga aag gga aca 
Lys Phe Ala Pro Gin He Arg Val Asn Leu Asn Tyr Arg Lys Gly Thr 
50 55 60 

ggt gec agt cag ccg gat cca aac cga ttt gac cga gat egg etc ttc 
Gly Ala Ser Gin Pro Asp Pro Asn Arg Phe Asp Arg Asp Arg Leu Phe 
65 70 75 80 

aat gcg gtc tec egg ggt gtc ccc gag gat ctg get gga ctt cca gag 
Asn Ala Val Ser Arg Gly Val Pro Glu Asp Leu Ala Gly Leu Pro Glu 
85 90 95 

tac ctg age aag acc age aag tac etc acc gac teg gaa tac aca gag 
Tyr Leu Ser Lys Thr Ser Lys Tyr Leu Thr Asp Ser Glu Tyr Thr Glu 
100 105 110 

ggc tec aca ggt aag acg tgc ctg atg aag get gtg ctg aac ctt aag 744 
Gly Ser Thr Gly Lys Thr Cys Leu Met Lys Ala Val Leu Asn Leu Lys 
115 120 125 



gac gga gtc aat gee tgc att ctg cca ctg ctg cag ate gac agg gac 

Asp Gly Val Asn Ala Cys He Leu Pro Leu Leu Gin He Asp Arg Asp 
130 135 140 

tct ggc aat cct cag ccc ctg gta aat gee cag tgc aca gat gac tat 

Ser Gly Asn Pro Gin Pro Leu Val Asn Ala Gin Cys Thr Asp Asp Tyr 

145 150 155 160 

tac cga ggc cac age get ctg cac ate gee att gag aag agg agt ctg 

Tyr Arg Gly His Ser Ala Leu His He Ala He Glu Lys Arg Ser Leu 



552 



600 



648 



696 



792 



840 



888 
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165 170 175 

cag tgt gtg aag etc ctg gtg gag aat ggg gec aat gtg cat gec egg 936 
Gin Cys Val Lys Leu Leu Val Glu Asn Gly Ala Asn Val His Ala Arg 
180 185 190 

gec tgc ggc cgc ttc ttc cag aag ggc caa ggg act tgc ttt tat ttc 984 
Ala Cys Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe 
195 200 205 

ggt gag eta ccc etc tct ttg gec get tgc acc aag cag tgg gat gtg 1032 
Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val 
210 215 220 

gta age tac etc ctg gag aac cca cac cag ccc gee age ctg cag gee 1080 
Val Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala 
225 230 235 240 

act gac tec cag ggc aac aca gtc ctg cat gee eta gtg atg ate teg 1128 
Thr Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser 
245 250 255 

gac aac tea get gag aac att gca ctg gtg acc age atg tat gat ggg 1176 
Asp Asn Ser Ala Glu Asn He Ala Leu Val Thr Ser Met Tyr Asp Gly 
260 265 270 



etc etc caa get ggg gec cgc etc tgc cct acc gtg cag ctt gag gac 1224 
Leu Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp 
275 280 285 

ate cgc aac ctg cag gat etc acg cct ctg aag ctg gec gec aag gag 1272 
He Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu 
290 295 300 

ggc aag ate gag att ttc agg cac ate ctg cag egg gag ttt tea gga 1320 
Gly Lys He Glu He Phe Arg His He Leu Gin Arg Glu Phe Ser Gly 
305 310 315 320 

ctg age cac ctt tec cga aag ttc acc gag tgg tgc tat ggg cct gtc 1368 
Leu Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val 
325 330 335 

egg gtg teg ctg tat gac ctg get tct gtg gac age tgt gag gag aac 1416 
Arg Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn 
340 345 350 

tea gtg ctg gag ate att gec ttt cat tgc aag age ccg cac cga cac 1464 
Ser Val Leu Glu lie lie Ala Phe His Cys Lys Ser Pro His Arg His 
355 360 365 

cga atg gtc gtt ttg gag ccc ctg aac aaa ctg ctg cag gcg aaa tgg 1512 
Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp 
370 375 380 

gat ctg etc ate ccc aag ttc ttc tta aac ttc ctg tgt aat ctg ate 1560 
Asp Leu Leu lie Pro Lys Ph Phe Leu Asn Phe Leu Cys Asn Leu lie 
385 390 395 400 
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tac atg ttc ate ttc acc get gtt gee tac cat cag' cct ace ctg aag 1608 
Tyr M t Phe He Ph Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys 
405 410 415 



aag cag gec gee cct cac ctg aaa gcg gag gtt gga aac tec atg ctg 
Lys Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu 
420 425 430 



aca gtg gtg tec cag gtg ctg tgt ttc ctg gec ate gag tgg tac ctg 
Thr Val Val Ser Gin Val Leu Cys Phe Leu Ala He Glu Trp Tyr Leu 
485 490 495 



1656 



ctg acg ggc cac ate ctt ate ctg eta ggg ggg ate tac etc etc gtg 1704 
Leu Thr Gly His He Leu He Leu Leu Gly Gly He Tyr Leu Leu Val 
435 440 445 

ggc cag ctg tgg tac ttc tgg egg cgc cac gtg ttc ate tgg ate teg 1752 
Gly Gin Leu Trp Tyr Phe Trp Arg Arg His Val Phe lie Trp He Ser 
450 455 460 

ttc ata gac age tac ttt gaa ate etc ttc ctg ttc cag gee ctg, etc 1800 
Phe He Asp Ser Tyr Phe Glu lie Leu Phe Leu Phe Gin Ala Leu Leu 
465 470 475 480 



1848 



ccc ctg ctt gtg tct gcg ctg gtg ctg ggc tgg ctg aac ctg ctt tac 1896 
Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr 
500 505 510 



tat aca cgt ggc ttc cag cac aca ggc ate tac agt gtc atg ate cag 1944 

Tyr Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val Met He Gin 

515 520 525 

aag gtc ate ctg egg gac ctg ctg cgc ttc ctt ctg ate tac tta gtc 1992 

Lys Val lie Leu Arg Asp Leu Leu Arg Phe Leu Leu lie Tyr Leu Val 

530 535 540 

ttc ctt ttc ggc ttc get gta gec ctg gtg age ctg age cag gag get 2040 

Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser Gin Glu Ala 

545 550 555 560 



tgg cgc ccc gaa get cct aca ggc ccc aat gee aca gag tea gtg cag 2088 
Trp Arg Pro Glu Ala Pro Thr Gly Pro Asn Ala Thr Glu Ser Val Gin 
565 570 575 

ccc atg gag gga cag gag gac gag ggc aac ggg gee cag tac agg ggt 2136 
Pro Met Glu Gly Gin Glu Asp Glu Gly Asn Gly Ala Gin Tyr Arg Gly 
580 585 590 

ate ctg gaa gec tec ttg gag etc ttc aaa ttc acc ate ggc atg ggc 2184 
lie Leu Glu Ala Ser Leu Glu Leu Phe Lys Phe Thr lie Gly Met Gly 
595 600 605 

gag ctg gee ttc cag gag cag ctg cac ttc cgc ggc atg gtg ctg ctg 2232 
Glu Leu Ala Phe Gin Glu Gin Leu His Phe Arg Gly Met Val Leu Leu 
610 615 620 

ctg ctg ctg gec tac gtg ctg etc acc tac ate ctg ctg etc aac atg 2280 
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Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr He Leu Leu Leu Asn Met 
625 630 635 640 

etc ate gec etc atg age gag acc gtc aac agt gtc gee act gac age 2328 
Leu He Ala Leu Met Ser Glu Thr Val Asn Ser Val Ala Thr Asp Ser 
645 650 655 

tgg age ate tgg aag ctg cag aaa gec ate tct gtc ctg gag atg gag 2376 
Trp Ser He Trp Lys Leu Gin Lys Ala He Ser Val Leu Glu Met Glu 
660 665 670 

aat ggc tat tgg tgg tgc agg aag aag cag egg gca ggt gtg atg ctg 2424 
Asn Gly Tyr Trp Trp Cys Arg Lys Lys Gin Arg Ala Gly Val Met Leu 
675 680 685 

acc gtt ggc act aag cca gat ggc age ccg gat gag cgc tgg tgc ttc 2472 
Thr Val Gly Thr Lys Pro Asp Gly Ser Pro Asp Glu Arg Trp Cys Phe 
690 695 700 

agg gtg gag gag gtg aac tgg get tea tgg gag cag acg ctg cct acg 2520 
Arg Val Glu Glu Val Asn Trp Ala Ser Trp Glu Gin Thr Leu Pro Thr 
705 710 715 720 

ctg tgt gag gac ccg tea ggg gca ggt gtc cct cga act etc gag aac 2568 
Leu Cys Glu Asp Pro Ser Gly Ala Gly Val Pro Arg Thr Leu Glu Asn 
725 730 735 

cct gtc ctg get tec cct ccc aag gag gat gag gat ggt gec tct gag 2616 
Pro Val Leu Ala Ser Pro Pro Lys Glu Asp Glu Asp Gly Ala Ser Glu 
740 745 750 

gaa aac tat gtg ccc gtc cag etc etc cag tec aac tgatggccca 2662 
Glu Asn Tyr Val Pro Val Gin Leu Leu Gin Ser Asn 
755 760 

gatgeagcag gaggecagag gacagagcag aggatctttc caaccacatc tgctggctct 2722 

ggggtcccag tgaattctgg tggcaaatat atattttcac taactcaaaa aaaaaaaaaa 2782 

aaaaaaaaaa aaaaaaaaaa aaaaaaa 2809 



<210> 5 

<211> 764 

<212> PRT 

<213> Homo sapiens 

<400> 5 

Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu Asp 
15 10 15 

Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu Asp Phe 
20 25 30 

Gly Ser Gly Leu Pro Pro Met Glu Ser Gin Phe Gin Gly Glu Asp Arg 
35 40 45 

Lys Phe Ala Pro Gin lie Arg Val Asn Leu Asn Tyr Arg Lys Gly Thr 
50 55 60 
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Gly Ala Ser Gin Pro Asp Pro Asn Arg Phe Asp Arg Asp Arg Leu Phe 
65 70 75 80 

Asn Ala Val Ser Arg Gly Val Pro Glu Asp Leu Ala Gly Leu Pro Glu 
85 90 95 

Tyr Leu Ser Lys Thr Ser Lys Tyr Leu Thr Asp Ser Glu Tyr Thr Glu 
100 105 110 

Gly Ser Thr Gly Lys Thr Cys Leu Met Lys Ala Val Leu Asn Leu Lys 
115 120 125 

Asp Gly Val Asn Ala Cys lie Leu Pro Leu Leu Gin lie Asp Arg Asp 
130 135 140 

Ser Gly Asn Pro Gin Pro Leu Val Asn Ala Gin Cys Thr Asp Asp Tyr 
145 150 155 160 

Tyr Arg Gly His Ser Ala Leu His He Ala He Glu Lys Arg Ser Leu 
165 170 175 

Gin Cys Val Lys Leu Leu Val Glu Asn Gly Ala Asn Val His Ala Arg 
180 185 190 • 

Ala Cys Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe 
195 200 205 

Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val 
210 215 220 

Val Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala 
225 230 235 240 

Thr Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met He Ser 
245 250 255 



Asp Asn Ser 



Leu Leu Gin 
275 

He Arg Asn 
290 

Gly Lys He 
305 

Leu Ser His 



Arg Val Ser 



Ser Val Leu 
355 



Ala Glu Asn He 
260 

Ala Gly Ala Arg 



Leu Gin Asp Leu 
295 

Glu He Phe Arg 
310 

Leu Ser Arg Lys 
325 

Leu Tyr Asp Leu 
340 

Glu He He Ala 



Ala Leu Val Thr Ser Met Tyr Asp Gly 
265 270 

Leu Cys Pro Thr Val Gin Leu Glu Asp 
280 285 

Thr Pro Leu Lys Leu Ala Ala Lys Glu 
300 

His He Leu Gin Arg Glu Phe Ser Gly 

315 320 

Phe Thr Glu Trp Cys Tyr Gly Pro Val 
330 335 

Ala Ser Val Asp Ser Cys Glu Qlu Asn 
345 350 

Phe His Cys Lys Ser Pro His Arg His 
360 365 
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Arg Met Val Val Leu Glu Pro Leu Asn Lys. Leu Leu Gin Ala Lys Trp 
370 375 380 

Asp Leu Leu He Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu He 
385 390 395 400 

Tyr Met Phe He Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys 
405 4io 415 

Lys Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu 
420 425 430 

Leu Thr Gly His He Leu He Leu Leu Gly Gly He Tyr Leu Leu Val 
435 440 445 

Gly Gin Leu Trp Tyr Phe Trp Arg Arg His Val Phe He Trp He Ser 
450 455 460 

Phe He Asp Ser Tyr Phe Glu He Leu Phe Leu Phe Gin Ala Leu Leu 
465 470 475 480 

Thr Val Val Ser Gin Val Leu Cys Phe Leu Ala He Glu Trp Tyr Leu 
485 490 495 

Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr 
500 505 510 

Tyr Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val Met He Gin 
515 520 525 

Lys Val lie Leu Arg Asp Leu Leu Arg Phe Leu Leu He Tyr Leu Val 
530 535 540 

Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser Gin Glu Ala 
545 550 555 560 

Trp Arg Pro Glu Ala Pro Thr Gly Pro Asn Ala Thr Glu Ser Val Gin 
565 570 575 

Pro Met Glu Gly Gin Glu Asp Glu Gly Asn Gly Ala Gin Tyr Arg Gly 
580 585 590 

He Leu Glu Ala Ser Leu Glu Leu Phe Lys Phe Thr lie Gly Met Gly 
595 600 605 

Glu Leu Ala Phe Gin Glu Gin Leu His Phe Arg Gly Met Val Leu Leu 
610 615 620 

Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr lie Leu Leu Leu Asn Met 
625 630 635 6 40 

Leu He Ala Leu Met Ser Glu Thr Val Asn Ser Val Ala Thr Asp Ser 
645 650 655 

Trp Ser lie Trp Lys Leu Gin Lys Ala lie Ser Val Leu Glu Met Glu 
660 665 670 

Asn Gly Tyr Trp Trp Cys Arg Lys Lys Gin Arg Ala Gly Val Met Leu 
675 680 685 
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Thr Val Gly Thr Lys Pro Asp Gly Ser Pro Asp Glu Arg Trp Cys Phe 
690 695 700 

Arg Val Glu Glu Val Asn Trp Ala Ser Trp Glu Gin Thr Leu Pro Thr 
705 710 715 720 

Leu Cys Glu Asp Pro Ser Gly Ala Gly Val Pro Arg Thr Leu Glu Asn 
725 730 735 

Pro Val Leu Ala Ser Pro Pro Lys Glu Asp Glu Asp Gly Ala Ser Glu 
740 745 750 

Glu Asn Tyr Val Pro Val Gin Leu Leu Gin Ser Asn 
755 760 



<210> 


6 




<211> 


2292 




<212> 


DNA 




<213> 


Homo 
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<220> 






<221> 


CDS 




<222> 


(1).. 


(2292) 


<400> 


6 





atg acc tea ccc tec age tct cca gtt ttc agg ttg gag aca tta gat 48 
Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu Asp 
15 10 15 

gga ggc caa gaa gat ggc tct gag gcg gac aga gga aag ctg gat ttt 96 
Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu Asp Phe 
20 25 30 



ggg age ggg ctg cct ccc atg gag tea cag ttc cag ggc gag gac egg 144 
Gly Ser Gly Leu Pro Pro Met Glu Ser Gin Phe Gin Gly Glu Asp Arg 
35 40 45 

aaa ttc gee cct cag ata aga gtc aac etc aac tac cga aag gga aca 192 
Lys Phe Ala Pro Gin lie Arg Val Asn Leu Asn Tyr Arg Lys Gly Thr 
50 55 60 

ggt gec agt cag ccg gat cca aac cga ttt gac cga gat egg etc ttc 240 
Gly Ala Ser Gin Pro Asp Pro Asn Arg Phe Asp Arg Asp Arg Leu Phe 
65 70 75 80 

aat gcg gtc tec egg ggt gtc ccc gag gat ctg get gga ctt cca gag 288 
Asn Ala Val Ser Arg Gly Val Pro Glu Asp Leu Ala Gly Leu Pro Glu 
85 90 95 

tac ctg age aag acc age aag tac etc acc gac teg gaa tac aca gag 336 
Tyr Leu Ser Lys Thr Ser Lys Tyr Leu Thr Asp Ser Glu Tyr Thr Glu 
100 105 110 

ggc tec aca ggt aag acg tgc ctg atg aag get gtg ctg aac ctt aag 384 
Gly Ser Thr Gly Lys Thr Cys Leu Met Lys Ala Val Leu Asn Leu Lys 
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115 120 125 

gac gga gtc aat gcc tgc att ctg cca ctg ctg cag ate gac agg gac 4 32 
Asp Gly Val Asn Ala Cys lie Leu Pro Leu Leu Gin lie Asp Arg Asp 
130 135 140 

tct ggc aat cct cag ccc ctg gta aat gcc cag tgc aca gat gac tat 480 
Ser Gly Asn Pro Gin Pro Leu Val Asn Ala Gin Cys Thr Asp Asp Tyr 
145 150 155 160 

tac cga ggc cac age get ctg cac ate gcc att gag aag agg agt ctg 528 
Tyr Arg Gly His Ser Ala Leu His lie Ala lie Glu Lys Arg Ser Leu 
165 170 175 

cag tgt gtg aag etc ctg gtg gag aat ggg gcc aat gtg cat gcc egg 576 
Gin Cys Val Lys Leu Leu Val Glu Asn Gly Ala Asn Val His Ala Arg 
180 185 190 

gcc tgc ggc cgc ttc ttc cag aag ggc caa ggg act tgc ttt tat ttc 624 
Ala Cys Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe 
195 200 205 

ggt gag eta ccc etc tct ttg gcc get tgc acc aag cag tgg gat gtg 672 
Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val 
210 215 220 

gta age tac etc ctg gag aac cca cac cag ccc gcc age ctg cag gcc 720 
Val Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala 
225 230 235 240 

act gac tec cag ggc aac aca gtc ctg cat gcc eta gtg atg ate teg 768 
Thr Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser 
245 250 255 

gac aac tea get gag aac att gca ctg gtg acc age atg tat gat ggg 816 
Asp Asn Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met Tyr Asp Gly 
260 265 270 



etc etc caa get ggg gcc cgc etc tgc cct acc gtg cag ctt gag gac 864 
Leu Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp 
275 280 285 

ate cgc aac ctg cag gat etc acg cct ctg aag ctg gcc gcc aag gag 912 
lie Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu 
290 295 300 

ggc aag ate gag att ttc agg cac ate ctg cag egg gag ttt tea gga 960 
Gly Lys lie Glu lie Phe Arg His lie Leu Gin Arg Glu Phe Ser Gly 
305 310 315 320 

ctg age cac ctt tec cga aag ttc acc gag tgg tgc tat ggg cct gtc 1008 
Leu Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val 
325 330 335 

egg gtg teg ctg tat gac ctg get tct gtg gac age tgt gag gag aac 1056 
Arg Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn 
340 345 350 
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tea gtg ctg gag ate att gec ttt cat tgc aag age'ecg cac cga cac 
Ser Val Leu Glu He II Ala Phe His Cys Lys Ser Pro His Arg His 
355 360 365 

cga atg gtc gtt ttg gag ccc ctg aac aaa ctg ctg cag gcg aaa tgg 
Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp 
370 375 380 

gat ctg etc ate ccc aag ttc ttc tta aac ttc ctg tgt aat ctg ate 
Asp Leu Leu He Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu He 
385 390 395 400 

tac atg ttc ate ttc acc get gtt gee tac cat cag cct acc ctg aag 
Tyr Met Phe He Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys 
405 410 415 

aag cag gec gee cct cac ctg aaa gcg gag gtt gga aac tec atg ctg 
Lys Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu 
420 425 430 

ctg acg ggc cac ate ctt ate ctg eta ggg ggg ate tac etc etc gtg 
Leu Thr Gly His He Leu He Leu Leu Gly Gly He Tyr Leu Leu Val 
435 440 445 

ggc cag ctg tgg tac ttc tgg egg cgc cac gtg ttc ate tgg ate teg 
Gly Gin Leu Trp Tyr Phe Trp Arg Arg His Val Phe He Trp He Ser 
450 455 460 

ttc ata gac age tac ttt gaa ate etc ttc ctg ttc cag gec ctg etc 
Phe He Asp Ser Tyr Phe Glu lie Leu Phe Leu Phe Gin Ala Leu Leu 
465 470 475 480 



aca gtg gtg tec cag gtg ctg tgt ttc ctg gec ate gag tgg tac ctg 
Thr Val Val Ser Gin Val Leu Cys Phe Leu Ala He Glu Trp Tyr Leu 
485 490 495 

ccc ctg ctt gtg tct gcg ctg gtg ctg ggc tgg ctg aac ctg ctt tac 
Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr 
500 505 510 

tat aca cgt ggc ttc cag cac aca ggc ate tac agt gtc atg ate cag 
Tyr Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val Met He Gin 
515 520 525 

aag gtc ate ctg egg gac ctg ctg cgc ttc ctt ctg ate tac tta gtc 
Lys Val He Leu Arg Asp Leu Leu Arg Phe Leu Leu He Tyr Leu Val 
530 535 540 

ttc ctt ttc ggc ttc get gta gec ctg gtg age ctg age cag gag get 
Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser Gin Glu Ala 
545 550 555 560 

tgg cgc ccc gaa get cct aca ggc ccc aat gee aca gag tea gtg cag 
Trp Arg Pro Glu Ala Pro Thr Gly Pro Asn Ala Thr Glu Ser Val Gin 
565 570 575 

ccc atg gag gga cag gag gac gag ggc aac ggg gec cag tac agg ggt 
Pro Met Glu Gly Gin Glu Asp Glu Gly Asn Gly Ala Gin Tyr Arg Gly 



1104 
1152 
1200 
1248 
1296 
1344 
1392 
1440 

1488 
1536 
1584 
1632 
1680 
1728 
1776 
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580 585 590 

ate ctg gaa gec tec ttg gag etc ttc aaa ttc acc ate ggc atg ggc 1824 
He Leu Glu Ala Ser Leu Glu Leu Phe Lys Phe Thr lie Gly Met Glv 
"5 600 605 

gag ctg gee ttc cag gag cag ctg cac ttc cgc ggc atg gtg ctg eta 1872 
Glu Leu Ala Phe Gin Glu Gin Leu His Phe Arg Gly Met Val Leu Leu 
610 615 620 

ctg ctg ctg gec tac gtg ctg etc acc tac ate ctg ctg etc aac atg 1920 
Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr He Leu Leu Leu Asn Met 
625 «0 6 35 64Q 

etc ate gee etc atg age gag acc gtc aac agt gtc gee act gac age 1968 
Leu lie Ala Leu Met Ser Glu Thr Val Asn Ser Val Ala Thr Asp Ser 
645 650 655 

tgg age ate tgg aag ctg cag aaa gee ate tct gtc ctg gag atg gag 2016 
Trp Ser lie Trp Lys Leu Gin Lys Ala lie Ser Val Leu Glu Met Glu 
660 665 670 

aat ggc tat tgg tgg tgc agg aag aag cag egg gca ggt gtg atg ctg 2064 
Asn Gly Tyr Trp Trp Cys Arg Lys Lys Gin Arg Ala Gly Val Met Leu 
67 5 680 685 

acc gtt ggc act aag cca gat ggc age ccg gat gag cgc tgg tgc ttc 2112 
Thr Val Gly Thr Lys Pro Asp Gly Ser Pro Asp Glu Arg Trp Cvs Phe 
690 695 700 



gaa aac tat gtg ccc gtc cag etc etc cag tec aac 
Glu Asn Tyr Val Pro Val Gin Leu Leu Gin Ser Asn 
755 760 



2160 



2208 



agg gtg gag gag gtg aac tgg get tea tgg gag cag acg ctg cct acg 
Arg Val Glu Glu Val Asn Trp Ala Ser Trp Glu Gin Thr Leu Pro Thr 
705 710 715 720 

ctg tgt gag gac ccg tea ggg gca ggt gtc cct cga act etc gag aac 
Leu Cys Glu Asp Pro Ser Gly Ala Gly Val Pro Arg Thr Leu Glu Asn 
725 730 735 

cct gtc ctg get tec cct ccc aag gag gat gag gat ggt gee tct gag 2256 
Pro Val Leu Ala Ser Pro Pro Lys Glu Asp Glu Asp Gly Ala Ser Glu 
740 745 750 



2292 
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47 



gc ggc cgc ttc ttc cag aag ggc caa ggg act tgc ttt tat ttc ggt 
Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe Gly 
1 5 10 15 

gag eta ccc etc tct ttg gec get tgc acc aag cag tgg gat gtg gta 
Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val Val 
20 25 30 

age tac etc ctg gag aac cca cac cag ccc gee age ctg cag gee act 
Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala Thr 
35 40 45 

gac tec cag ggc aac aca gtc ctg cat gec eta gtg atg ate teg gac 191 
Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser Asp 
50 55 60 



aac tea get gag aac att gca ctg gtg acc age atg tat gat ggg etc 
Asn Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met Tyr Asp Gly Leu 
65 70 75 

etc caa get ggg gee cgc etc tgc cct acc gtg cag ctt gag gac ate 
Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp He 
80 85 90 95 

cgc aac ctg cag gat etc acg cct ctg aag ctg gee gee aag gag ggc 
Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu Gly 
100 105 110 



gtg teg ctg tat gac ctg get tct gtg gac age tgt gag gag aac tea 

Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn Ser 

145 150 155 

gtg ctg gag ate att gee ttt cat tgc aag age ccg cac cga cac cga 

Val Leu Glu He He Ala Phe His Cys Lys Ser Pro His Arg His Arg 

160 165 170 175 



95 



143 



239 



287 



335 



383' 



aag ate gag att ttc agg cac ate ctg cag egg gag ttt tea gga ctg 
Lys He Glu lie Phe Arg His He Leu Gin Arg Glu Phe Ser Gly Leu 
115 120 125 

age cac ctt tec cga aag ttc acc gag tgg tgc tat ggg cct gtc egg 431 
Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val Arg 
130 135 140 



479 



527 



atg gtc gtt ttg gag ccc ctg aac aaa ctg ctg cag gcg aaa tgg gat 575 
Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp Asp 
180 185 190 

ctg etc ate ccc aag ttc ttc tta aac ttc ctg tgt aat ctg ate tac 623 
Leu Leu He Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu He Tyr 
195 200 205 

atg ttc ate ttc acc get gtt gee tac cat cag cct acc ctg aag aag 671 
Met Phe He Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys Lys 
210 215 220 

cag gee gec cct cac ctg aaa gcg gag gtt gga aac tec atg ctg ctg 719 
Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu Leu 
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235 



acg ggc cac ate ctt ate ctg eta ggg ggg ate tac etc ri-o 

Tj, aj u . Ile u, u Le» u. a g ,s g «: a ?s a 767 

250 255 
cag ctg tgg tac ttc tag cqq cac cac ata *. 

«. >. Tyr ». t?i ^ a s" s? £ it: s a a £ 615 

260 265 270 

ata gac age tac ttt gaa ate etc tte rtn . 

lie As P Ser , v P„e L. a. l2 £ S ST 3 S 2 £ £ 

280 285 

s? a a a a $ a a a a a a a a a •» 

295 300 

ctg ctt gtg tct gcg ctg gtg ctg ggc tgg ctg aac eta ctt t*r. «.... 
Leu Leu V.1 Ser Ala Leu Val Leu G ? y Trp LeJ Asn Lei HI £ £ 

310 325 

aca cgt ggc ttc cag cac aca ggc ate tac agt ate at« 

Th ; Arg Gly Phe « a Hi S Thr Gly Ile Tyr ^ £ 2? s; G ca n 9 ^ 

325 330 



959 



1007 



a a a a a a a a a a a s a a a a 1055 
"i a a a a is a a a a is a a a a a 1103 

360 



ggc age ccg gat gag cgc tgg tgc ttc agg gtg gag gag gtq aac taa 
Gly Ser Pre Asp Glu Arg Trp Cys Phe Arg Vl U 111 tl Asn Trp 



1151 



380 



a a a a a a a a s a a a a a a a "** 

390 395 J 



1247 



gca ggt gtc cct cga act etc gag aac cct ate ctn nr >+ +~ 
f * Gly Val Pro Arg Thr Leu 11 As' £ S? S S £ Pro Pro 

405 4io 415 

f« r? 9 ? 3t ?f 9 9at 99t 9CC tCt ga 9 9 aa aac tat gtg ccc gtc cao 12« 
Lys Glu Asp Glu Asp Gly Ala Ser Glu Glu Asn Tyr Va? Pro fal "n 

420 4 25 430 

L^S Leu Ser' A^n t9at " CCCa 93t9Ca9Ca9 9 *" CCa ^ 9-agagcag 1350 
435 

aggatctttc eaaccacatc tgctggetct ggggtcccag tgaattctgg tggcaaatat 1410 
atattttcac taactcaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg 1470 
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agcggacgcg tgggtcgac 



<210> 8 
<211> 436 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe Gly Glu 
15 10 15 

Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val Val Ser 
20 25 30 

Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala Thr Asp 
35 40 45 

Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser Asp Asn 
50 55 60 

Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met Tyr Asp Gly Leu Leu 
65 70 75 80 



Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp lie Arg 
85 90 95 

Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu Gly Lys 
100 105 110 

lie Glu lie Phe Arg His He Leu Gin Arg Glu Phe Ser Gly Leu Ser 
115 120 125 

His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val Arg Val 
130 135 140 

Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn Ser Val 
145 150 155 160 

Leu Glu He He Ala Phe His Cys Lys Ser Pro His Arg His Arg Met 
165 170 175 

Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp Asp Leu 
180 185 190 

Leu He Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu lie Tyr Met 
195 200 205 

Phe lie Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys Lys Gin 
210 215 220 



Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu Leu Thr 

225 230 235 240 

Gly His lie Leu lie Leu Leu Gly Gly lie Tyr Leu Leu Val Gly Gin 

245 250 255 

Leu Trp Tyr Phe Trp Arg Arg His Val Phe He Trp lie Ser Phe He 
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260 265 270 

Asp Ser Tyr Phe Glu lie Leu Phe Leu Phe Gin Ala Leu Leu Thr Val 
275 280 285 

Val Ser Gin Val Leu Cys Phe Leu Ala lie Glu Trp Tyr Leu Pro Leu 
290 295 300 

Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr Tyr Thr 
305 310 315 320 

Arg Gly Phe Gin His Thr Gly He Tyr Ser Val Met He Gin Lys Lys 
325 330 335 

Ala He Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys Arg Lys 
340 345 350 

Lys Gin Arg Ala Gly Val Met Leu Thr Val * Gly Thr Lys Pro Asp Gly 
355 360 365 

Ser Pro Asp Glu Arg Trp Cys Phe Arg Val Glu Glu Val Asn Trp Ala 
370 375 380 

Ser Trp Glu Gin Thr Leu Pro Thr Leu Cys Glu Asp Pro Ser Gly Ala 
385 390 395 400 

Gly Val Pro Arg Thr Leu Glu Asn Pro Val Leu Ala Ser Pro Pro Lys 
405 410 415 

Glu Asp Glu Asp Gly Ala Ser Glu Glu Asn Tyr Val Pro Val Gin Leu 
420 425 430 

Leu Gin Ser Asn 
435 



<210> 9 
<211> 1308 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1308) 

<400> 9 

ggc cgc ttc ttc cag aag ggc caa ggg act tgc ttt tat ttc ggt gag 48 

Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe Gly Glu 
15 10 15 



eta ccc etc tct ttg gec get tgc ace aag cag tgg gat gtg gta age 96 
Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val Val Ser 
20 25 30 

tac etc ctg gag aac cca cac cag ccc gee age ctg cag gec act gac 14 4 
Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala Thr Asp 
35 40 45 
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tec cag ggc aac aca gtc ctg cat gec eta gtg atg'atc teg gac aac 192 
Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser Asp Asn 
. 50 55 60 

tea get gag aac att gca ctg gtg ace age atg tat gat ggg etc etc 240 
Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met Tyr Asp Gly Leu Leu 
65 70 75 80 

caa get ggg gec cgc etc tgc cct ace gtg cag ctt gag gac ate cgc 288 
Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp He Arg 
85 90 95 

aac ctg cag gat etc acg cct ctg aag ctg gec gec aag gag ggc aag 336 
Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu Gly Lys 
100 105 110 

ate gag att ttc agg cac ate ctg cag egg gag ttt tea gga ctg age 384 
He Glu He Phe Arg His He Leu Gin Arg Glu Phe Ser Gly Leu Ser 
115 120 125 

cac ctt tec cga aag ttc acc gag tgg tgc tat ggg cct gtc egg gtg 432 
His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val Arg Val 
130 135 140 

teg ctg tat gac ctg get tct gtg gac age tgt gag gag aac tea gtg 480 
Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn Ser Val 
145 150 155 160 

ctg gag ate att gec ttt cat tgc aag age ccg cac cga cac cga atg 528 
Leu Glu He lie Ala Phe His Cys Lys Ser Pro His Arg His Arg Met 
165 170 175 

gtc gtt ttg gag ccc ctg aac aaa ctg ctg cag gcg aaa tgg gat ctg 576 
Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp Asp Leu 
180 185 190 

etc ate ccc aag ttc ttc tta aac ttc ctg tgt aat ctg ate tac atg 624 
Leu He Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu He Tyr Met 
195 200 205 

ttc ate ttc acc get gtt gec tac cat cag cct acc ctg aag aag cag 672 
Phe He Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys Lys Gin 
210 215 220 

gee gee cct cac ctg aaa gcg gag gtt gga aac tec atg ctg ctg acg 720 
Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu Leu Thr 
225 230 235 240 

ggc cac ate ctt ate ctg eta ggg ggg ate tac etc etc gtg ggc cag 768 
Gly His He Leu He Leu Leu Gly Gly He Tyr Leu Leu Val Gly Gin 
245 250 255 



ctg tgg tac ttc tgg egg cgc cac 
Leu Trp Tyr Phe Trp Arg Arg His 
260 

gac age tac ttt gaa ate etc ttc 
Asp Ser Tyr Phe Glu He Leu Phe 



gtg ttc ate tgg ate teg ttc ata 816 
Val Phe He Trp He Ser Phe He 
265 270 

ctg ttc cag gee ctg etc aca gtg 864 
Leu Phe Gin Ala Leu Leu Thr Val 
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275 280 



265 



912 



960 



1008 



1056 



1104 



£? Ill r?n S? f t9 r 9t Iu C Ct9 9CC atC gag tgg ta = ctg ccc ctg 
Ser Gin Val Leu Cys Phe Leu Ala lie Glu Trp Tyr Leu Pro Leu 

290 295 300 

fl» v.? t Ct f t9 f ? Ct9 990 tg9 Ctg aac ct 9 ctt tac tat aca 
Leu Val Ser Ala Leu Val Leu Gly Trp Leu Aan Leu Leu Tyr Tyr Thr 

305 310 315 320 

cgt ggc ttc cag cac aca ggc ate tac agt gtc atg ate caa aaa a« 
Arg Gly Phe Gin His Thr Gly lie Tyr sir ?al Me? lie Gin Lys Lys 
325 330 335 

gec ate tct gtc ctg gag atg gag aat ggc tat tgg tgg tgc agg aag 
Ala lie Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys Arg Lys 
340 345 350 

aag cag egg gca ggt gtg atg ctg acc gtt ggc act aag cca gat ggc 
Lys Gin Arg Ala Gly Val Met Leu Thr Val Gly Thr Lys Pro Asp Gly 
355 360 365 Y 

age ccg gat gag cgc tgg tgc ttc agg gtg gag gag gt g aac tgg get 1152 
Ser Pro Asp Glu Arg Trp Cys Phe Arg Val Glu Glu Val Asn Trp Ala 
370 375 380 

tea tgg gag cag acg ctg cct acg ctg tgt gag gac ccg tea aaa aca 
Ser Trp Glu Gin Thr Leu Pro Thr Leu Cys 111 Lp Pro Ser Gly 111 
385 390 395 400 

fil ?J D Ct n 93 ^ f tC 9 f 9 3aC CCt 9tC Ctg * ct tcc cct ccc aag 
Gly Val Pro Arg Thr Leu Glu Asn Pro Val Leu Ala Ser Pro Pro Lys 

405 410 415 

gag gat gag gat ggt gee tct gag gaa aac tat gtg ccc gtc cag etc 
Glu Asp Glu Asp Gly Ala Ser Glu Glu Asn Tyr Val Pro Val Gin Leu 
420 425 430 

etc cag tcc aac 
Leu Gin Ser Asn 
435 



1200 



1248 



1296 



1308 



<210> 10 
<211> 1794 
<212> DNA 
<213> Rattus sp. 
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<220> 

<221> CDS 

<222> (2) . . (1663) 

<400> 10 

g teg acc cac gcg tec get ctt tct ctg get gcg tgc ace aag cag tgg 49 

Ser Thr His Ala Ser Ala Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp 
15 10 15 

gat gtg gtg acc tac etc ctg gag aac cca cac cag ccg gee age ctg 97 
Asp Val Val Thr Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu 
20 25 30 

gag gec acc gac tec ctg ggc aac aca gtc ctg cat get ctg gta atg 145 
Glu Ala Thr Asp Ser Leu Gly Asn Thr Val Leu His Ala Leu Val Met 
35 40 45 

att gca gat aac teg cct gag aac agt gee ctg gtg ate cac atg tac 193 
He Ala Asp Asn Ser Pro Glu Asn Ser Ala Leu Val He His Met Tyr 
50 55 60 

gac ggg ctt eta caa atg ggg gcg cgc etc tgc ccc act gtg cag ctt 241 
Asp Gly Leu Leu Gin Met Gly Ala Arg Leu Cys Pro Thr Val Gin Leu 
65 70 75 80 

gag gaa ate tec aac cac caa ggc etc aca ccc ctg aaa eta gee gec 289 
Glu Glu He Ser Asn His Gin Gly Leu Thr Pro Leu Lys Leu Ala Ala 
85 90 95 

aag gaa ggc aaa ate gag att ttc agg cac att ctg cag egg gaa ttc 337 
Lys Glu Gly Lys He Glu He Phe Arg His He Leu Gin Arg Glu Phe 
100 105 110 

tea gga ccg tac cag ccc ctt tec cga aag ttt act gag tgg tgt tac 385 
Ser Gly Pro Tyr Gin Pro Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr 
115 120 125 

ggt cct gtg egg gta teg ctg tac gac ctg tec tct gtg gac age tgg 433 
Gly Pro Val Arg Val Ser Leu Tyr Asp Leu Ser Ser Val Asp Ser Trp 
130 135 140 

gaa aag aac teg gtg ctg gag ate ate get ttt cat tgc aag age ccg 481 
Glu Lys Asn Ser Val Leu Glu He He Ala Phe His Cys Lys Ser Pro 
145 150 155 160 

aac egg cac cgc atg gtg gtt tta gaa cca ctg aac aag ctt ctg cag 529 
Asn Arg His Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin 
165 170 175 

gag aaa tgg gat egg etc gtc tea aga ttc ttc ttc aac ttc gee tgc 577 
Glu Lys Trp Asp Arg Leu Val Ser Arg Phe Phe Phe Asn Phe Ala Cys 
180 185 190 

tac ttg gtc tac atg ttc ate ttc acc gtc gtt gec tac cac cag cct 625 
Tyr Leu Val Tyr Met Phe He Phe Thr Val Val Ala Tyr His Gin Pro 
195 200 205 

tec ctg gat cag cca gee ate ccc tea tea aaa gcg act ttt ggg gaa 673 
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Ser Leu Asp Gin Pro Ala lie Pro Ser Ser Lys Ala Thr Phe Gly Glu 
210 215 220 

tec atg ctg ctg ctg ggc cac att ctg ate ctg ctt ggg ggt att tac 721 
Ser Met Leu Leu Leu Gly His lie Leu lie Leu Leu Gly Gly lie Tyr 
225 230 235 240 

etc tta ctg ggc cag ctg tgg tac ttt tgg egg egg cgc ctg ttt ate 769 
Leu Leu Leu Gly Gin Leu Tip Tyr Phe Trp Arg Arg Arg Leu Phe lie 
245 250 255 

tgg ate tea ttc atg gac age tac ttt gaa ate etc ttt etc ctt cag 817 
Trp lie Ser Phe Met Asp Ser Tyr Phe Glu lie Leu Phe Leu Leu Gin 
260 265 270 

get ctg etc aca gtg ctg tec cag gtg ctg cgc ttc atg gag act gaa 865 
Ala Leu Leu Thr Val Leu Ser Gin Val Leu Arg Phe Met Glu Thr Glu 
275 280 285 

tgg tac eta ccc ctg eta gtg tta tec eta gtg ctg ggc tgg ctg aac 913 
Trp Tyr Leu Pro Leu Leu Val Leu Ser Leu Val Leu Gly Trp Leu Asn 
290 295 300 



ctg ctt tac tac aca egg ggc ttt cag cac aca ggc ate tac agt gtc 961 
Leu Leu Tyr Tyr Thr Arg Gly Phe Gin His Thr Gly lie Tyr Ser Val 
305 310 315 320 

atg ate cag aag gtc ate ctt cga gac ctg etc cgt ttc ctg ctg gtc 1009 
Met lie Gin Lys Val lie Leu Arg Asp Leu Leu Arg Phe Leu Leu Val 
325 330 335 

tac ctg gtc ttc ctt ttc ggc ttt get gta gec eta gta age ttg age 1057 
Tyr Leu Val Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser 
340 345 350 

aga gag gee cga agt ccc aaa gee cct gaa gat aac aac tec aca gtg 1105 
Arg Glu Ala Arg Ser Pro Lys Ala Pro Glu Asp Asn Asn Ser Thr Val 
355 360 365 

acg gaa cag ccc acg gtg ggc cag gag gag gag cca get cca tat egg 1153 
Thr Glu Gin Pro Thr Val Gly Gin Glu Glu Glu Pro Ala Pro Tyr Arg 
370 375 380 

age att ctg gat gee tec eta gag ctg ttc aag ttc acc att ggt atg 1201 
Ser He Leu Asp Ala Ser Leu Glu Leu Phe Lys Phe Thr He Gly Met 
385 390 395 400 

ggg gag ctg get ttc cag gaa cag ctg cgt ttt cgt ggg gtg gtc ctg 124 9 
Gly Glu Leu Ala Phe Gin Glu Gin Leu Arg Phe Arg Gly Val Val Leu 
405 410 415 

ctg ttg ctg ttg gec tac gtc ctt etc acc tac gtc ctg ctg etc aac 1297 
Leu Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr Val Leu Leu Leu Asn 
420 425 430 

atg etc att get etc atg age gaa act gtc aac cac gtt get gac aac 1345 
Met Leu He Ala Leu Met Ser Glu Thr Val Asn His Val Ala Asp Asn 
435 440 445 
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age tgg age ate tgg aag ttg cag aaa gec ate tct gtc ttg gag atg 1393 

Ser Trp Ser lie Trp Lys Leu Gin Lys Ala II S r Val Leu Glu M t 

450 455 460 

gag aat ggt tac tgg tgg tgc egg agg aag aaa cat cgt gaa ggg agg 1441 

Glu Asn Gly Tyr Trp Trp Cys Arg Arg Lys Lys His Arg Glu Gly Arg 
465 470 475 480 

ctg ctg aaa gtc ggc acc agg ggg gat ggt acc cct gat gag cgc tgg 1489 

Leu Leu Lys Val Gly Thr Arg Gly Asp Gly Thr Pro Asp Glu Arg Trp 
485 490 495 

tgc ttc agg gtg gag gaa gta aat tgg get get tgg gag aag act ctt 1537 

Cys Phe Arg Val Glu Glu Val Asn Trp Ala Ala Trp Glu Lys Thr Leu 
500 505 510 

ccc acc tta tct gag gat cca tea ggg cca ggc ate act ggt aat aaa 1585 

Pro Thr Leu Ser Glu Asp Pro Ser Gly Pro Gly lie Thr Gly Asn Lys 
515 520 525 



aag aac cca acc tct aaa ccg ggg aag aac agt gee tea gag gaa gac 1633 
Lys Asn Pro Thr Ser Lys Pro Gly Lys Asn Ser Ala Ser Glu Glu Asp 
530 535 540 

cat ctg ccc ctt cag gtc etc cag tec ccc tgatggccca gatgeagcag 1683 
His Leu Pro Leu Gin Val Leu Gin Ser Pro 
545 550 

caggctggca ggatggagta gggaatcttc ccagccacac cagaggctac tgaattttgg 174 3 
tggaaatata aatatttttt ttgcataaaa aaaaaaaaaa agggeggecg c 1794 



<210> 11 

<211> 554 

<212> PRT 

<213> Rattus sp. 

<400> 11 

Ser Thr His Ala Ser Ala Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp 
15 10 15 

Asp Val Val Thr Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu 
20 25 30 

Glu Ala Thr Asp Ser Leu Gly Asn Thr Val Leu His Ala Leu Val Met 
35 40 45 

lie Ala Asp Asn Ser Pro Glu Asn Ser Ala Leu Val lie His Met Tyr 
50 55 60 

Asp Gly Leu Leu Gin Met Gly Ala Arg Leu Cys Pro Thr Val Gin Leu 
65 70 75 80 

Glu Glu lie Ser Asn His Gin Gly Leu Thr Pro Leu Lys Leu Ala Ala 
85 90 95 
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Lys Glu Gly Lys He Glu He Phe Arg His He Leu Gin Arg Glu Phe 
100 105 110 

Ser Gly Pro Tyr Gin Pro Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr 
115 120 125 

Gly Pro Val Arg Val Ser Leu Tyr Asp Leu Ser Ser Val Asp Ser Trp 
130 135 140 

Glu Lys Asn Ser Val Leu Glu He He Ala Phe His Cys Lys Ser Pro 
145 150 155 160 

Asn Arg His Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin 
165 170 175 



Glu Lys Trp Asp Arg Leu Val Ser Arg Phe Phe Phe Asn Phe Ala Cys 
180 185 190 

Tyr Leu Va 1 Tyr Met Phe He Phe Thr Val Val Ala Tyr His Gin Pro 
195 200 205 

Ser Leu Asp Gin Pro Ala He Pro Ser Ser Lys Ala Thr Phe Gly Glu 
210 215 220 

Ser Met Leu Leu Leu Gly His lie Leu lie Leu Leu Gly Gly lie Tyr 
225 230 235 240 

Leu Leu Leu Gly Gin Leu Trp Tyr Phe Trp Arg Arg Arg Leu Phe lie 
245 250 255 

Trp He Ser Phe Met Asp Ser Tyr Phe Glu lie Leu Phe Leu Leu Gin 
260 265 270 

Ala Leu Leu Thr Val Leu Ser Gin Val Leu Arg Phe Met Glu Thr Glu 
275 280 285 

Trp Tyr Leu Pro Leu Leu Val Leu Ser Leu Val Leu Gly Trp Leu Asn 
290 295 300 

Leu Leu Tyr Tyr Thr Arg Gly Phe Gin His Thr Gly lie Tyr Ser Val 
305 310 315 320 

Met He Gin Lys Val lie Leu Arg Asp Leu Leu Arg Phe Leu Leu Val 
325 330 335 

Tyr Leu Val Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser 
340 345 350 

Arg Glu Ala Arg Ser Pro Lys Ala Pro Glu Asp Asn Asn Ser Thr Val 
355 360 365 

Thr Glu Gin Pro Thr Val Gly Gin Glu Glu Glu Pro Ala Pro Tyr Arg 
370 375 380 

Ser lie Leu Asp Ala Ser Leu Glu Leu Phe Lys Phe Thr lie Gly Met 
385 390 395 400 

Gly Glu Leu Ala Phe Gin Glu Gin Leu Arg Phe Arg Gly Val Val Leu 
405 410 415 
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Leu Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr Val Leu Leu Leu Asn 
420 425 430 

Met Leu lie Ala Leu Met Ser Glu Thr Val Asn His Val Ala Asp Asn 
435 440 445 

Ser Trp Ser lie Trp Lys Leu Gin Lys Ala He Ser Val Leu Glu Met 
450 455 460 

Glu Asn Gly Tyr Trp Trp Cys Arg Arg Lys Lys His Arg Glu Gly Arg 
465 470 475 480 

Leu Leu Lys Val Gly Thr Arg Gly Asp Gly Thr Pro Asp Glu Arg Trp 
485 490 495 

Cys Phe Arg Val Glu Glu Val Asn Trp Ala Ala Trp Glu Lys Thr Leu 
500 505 510 

Pro Thr Leu Ser Glu Asp Pro Ser Gly Pro Gly He Thr Gly Asn Lys 
515 520 525 

Lys Asn Pro Thr Ser Lys Pro Gly Lys Asn Ser Ala Ser Glu Glu Asp 
530 535 540 



His Leu Pro Leu Gin Val Leu Gin Ser Pro 
545 550 



<210> 12 
<211> 1662 
<212> DNA 
<213> Rattus sp. 

<220> 

<221> CDS 

<222> (1) . . (1662) 

<400> 12 

teg acc cac gcg tec get ctt tct ctg 
Ser Thr His Ala Ser Ala Leu Ser Leu 
1 5 

gat gtg gtg acc tac etc ctg gag aac 
Asp Val Val Thr Tyr Leu Leu Glu Asn 
20 25 

gag gec acc gac tec ctg ggc aac aca 
Glu Ala Thr Asp Ser Leu Gly Asn Thr 
35 40 

att gca gat aac teg cct gag aac agt 
He Ala Asp Asn Ser Pro Glu Asn Ser 
50 55 



get gcg tgc acc aag cag tgg 48 
Ala Ala Cys Thr Lys Gin Trp 
10 15 

cca cac cag ccg gee age ctg 96 
Pro His Gin Pro Ala Ser Leu 
30 

gtc ctg cat get ctg gta atg 144 
Val Leu His Ala Leu Val Met 
45 

gee ctg gtg ate cac atg tac 192 
Ala Leu Val He His Met Tyr 
60 



gac ggg ctt eta caa atg ggg gcg cgc etc tgc ccc act gtg cag ctt 240 
Asp Gly Leu Leu Gin Met Gly Ala Arg Leu Cys Pro Thr Val Gin Leu 
65 70 75 80 
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gag gaa ate tec aac cac caa qoc etc aca 

«. «. .U ser *s„ H is «. 2 2 £ £ «• £ £ 2 E ,M 

aag gaa ggc aaa ate gag att ttc aoa cac ^ 

LVS » B ay g. „. i» „. J» ~ J" «J «, c,o g. ttc 336 



110 



tea gga ccg tac cag ccc ctt tec cga aag ttt act a*n 
Ser Gly £ Tyr Gln Pro Leu ^ ly J »t act gag tgg tgt tac 

120 125 

ggt cct gtg egg gta teg ctg tac gac ctg tec tct a tn 
«y JJ. v al Srg „ S er u» T yr J, 2 £ £ ?.? £ 2 5 

XJt/ 135 140 

gaa aag aac teg gtg ctg gag ate ate act tt+ 

«. As„ s« « 2 SlJ „ e 2 2 2 2 2 5! £ 180 

130 155 



384 



432 



160 



aac egg cac cgc atg gtg qtt tta aaa r<-» r*+~ 

*- „.? 2 ? al u. 2 2 2 2 2 2 2 2 528 

163 270 



175 



gag aaa tgg gat egg etc gtc tea aga ttc ttc ttr 

«. T tP ^ L . u val s „ & £ s 2 2 2 2 2 

185 19Q 

tac ttg gtc tac atg ttc ate ttc acc qtc ott ocr +*r> 

Ty, «. vj, TyI „ K Phe „. p Ihr *« J2 J» «= «o cjj cct «, 

200 205 

tec ctg gat cag cca gee ate ccc tea tea aaa qco act ttt „„„ 

Ser Leu Asp G m Pro Ala Ue Pro Ser Ser ^ £? £ J£ Gly ITu ™ 

220 

2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 720 

235 240 

2 2 2 2 2 2 g 2 2 2 2 2 2 2 2 2 " 8 

245 "0 255 

tgg ate tea ttc atg gac age tac ttt oaa af - ^ 

^ ue s„ £ L P S L Iy < 2 2 2 2 2 2 2 2 816 

265 270 

2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 864 

280 285 

5 2 2 2 2 2 2 2 2 2 « 2 2 2 2 2 812 

295 300 

2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 960 
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atg ate cag aag gtc ate ctt cga gac ctg etc cgt ttc ctg ctg gtc 1008 
Met He Gin Lys Val He Leu Arg Asp Leu Leu Arg Phe Leu Leu Val 
325 330 335 

tac ctg gtc ttc ctt ttc ggc ttt get gta gec eta gta age ttg age 1056 
Tyr Leu Val Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser 
340 345 350 



aga gag gec cga agt ccc aaa gec cct gaa gat aac aac tec aca gtg 
Arg Glu Ala Arg Ser Pro Lys Ala Pro Glu Asp Asn Asn Ser Thr Val 
355 360 365 



1104 



acg gaa cag ccc acg gtg ggc cag gag gag gag cca get cca tat egg 1152 
Thr Glu Gin Pro Thr Val Gly Gin Glu Glu Glu Pro Ala Pro Tyr Arg 
370 375 380 

age att ctg gat gee tec eta gag ctg ttc aag ttc ace att ggt atg 1200 
Ser He Leu Asp Ala Ser Leu Glu Leu Phe Lys Phe Thr He Gly Met 
385 390 395 400 

ggg gag ctg get ttc cag gaa cag ctg cgt ttt cgt ggg gtg gtc ctg 1248 
Gly Glu Leu Ala Phe Gin Glu Gin Leu Arg Phe Arg Gly Val Val Leu 
405 410 415 

ctg ttg ctg ttg gec tac gtc ctt etc acc tac gtc ctg ctg etc aac 1296 
Leu Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr Val Leu Leu Leu Asn 
420 425 430 

atg etc att get etc atg age gaa act gtc aac cac gtt get gac aac 1344 
Met Leu He Ala Leu Met Ser Glu Thr Val Asn His Val Ala Asp Asn 
435 440 445 

age tgg age ate tgg aag ttg cag aaa gee ate tct gtc ttg gag atg 1392 
Ser Trp Ser He Trp Lys Leu Gin Lys Ala He Ser Val Leu Glu Met 
450 455 460 

gag aat ggt tac tgg tgg tgc egg agg aag aaa cat cgt gaa ggg agg 1440 
Glu Asn Gly Tyr Trp Trp Cys Arg Arg Lys Lys His Arg Glu Gly Arg 
465 470 475 480 

ctg ctg aaa gtc ggc acc agg ggg gat ggt acc cct gat gag cgc tgg 14 88 
Leu Leu Lys Val Gly Thr Arg Gly Asp Gly Thr Pro Asp Glu Arg Trp 
485 490 495 

tgc ttc agg gtg gag gaa gta aat tgg get get tgg gag aag act ctt 1536 
Cys Phe Arg Val Glu Glu Val Asn Trp Ala Ala Trp Glu Lys Thr Leu 
500 505 . 510 

ccc acc tta tct gag gat cca tea ggg cca ggc ate act ggt aat aaa 1584 
Pro Thr Leu Ser Glu Asp Pro Ser Gly Pro Gly He Thr Gly Asn Lys 
515 520 525 

aag aac cca acc tct aaa ccg ggg aag aac agt gee tea gag gaa gac 1632 
Lys Asn Pro Thr Ser Lys Pro Gly Lys Asn Ser Ala Ser Glu Glu Asp 
530 535 540 

cat ctg ccc ctt cag gtc etc cag tec ccc 1662 
His Leu Pro Leu Gin Val Leu Gin Ser Pro 
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<210> 13 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<400> 13 

Ala Phe His Cys Lys Ser Pro His Arg His Arg Met Val Val Leu Glu 
15 10 15 



<210> 14 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<400> 14 

Arg Pro Glu Ala Pro Thr Gly Pro Asn Ala Thr Glu Ser Val Gin Pro 
15 10 15 

Met Glu Gly Gin Glu Asp Glu Gly Asn 
20 25 

<210> 15 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<400> 15 

Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys Arg Lys Lys Gin 
1 5 ,10 15 

Arg Ala Gly 



<210> 16 
<211> 20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 16 

taggagaccc cgttgccacg 



<210> 17 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 17 

gattcacttg gggacagtga eg 22 



<210> 18 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 18 

ttaagctccc gttctccacc g 21 



<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 19 

getgegggag gaagtgaagc 



<210> 20 
<211> 630 
<212> PRT 

<213> Homo sapiens 
<400> 20 

Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu Asp 
15 10 15 

Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu Asp Phe 
20 25 30 
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Gly Ser Gly L u Pro Pro Met Glu Ser Gin Phe Gin Gly Glu Asp Arg 
35 40 45 

Lys Phe Ala Pro Gin He Arg Val Asn Leu Asn Tyr Arg Lys Gly Thr 
50 55 60 

Gly Ala Ser Gin Pro Asp Pro Asn Arg Phe Asp Arg Asp Arg Leu Phe 
65 70 75 80 

Asn Ala Val Ser Arg Gly Val Pro Glu Asp Leu Ala Gly Leu Pro Glu 
85 90 95 



Tyr Leu Ser Lys Thr Ser Lys Tyr Leu Thr Asp Ser Glu Tyr Thr Glu 
100 105 no 

Gly Ser Thr Gly Lys Thr Cys Leu Met Lys Ala Val Leu Asn Leu Lys 
115 120 125 

Asp Gly Val Asn Ala Cys lie Leu Pro Leu Leu Gin He Asp Arg Asp 
130 135 140 

Ser Gly Asn Pro Gin Pro Leu Val Asn Ala Gin Cys Thr Asp Asp Tyr 
145 "° 155 160 

Tyr Arg Gly His Ser Ala Leu His He Ala He Glu Lys Arg Ser Leu ■ 
165 170 175 

Gin Cys Val Lys Leu Leu Val Glu Asn Gly Ala Asn Val His Ala Arg 
180 185 190 

Ala Cys Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe 
1^5 200 205 

Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val 
210 215 220 

Val Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala 
225 230 235 2 40 

Thr Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met He Ser 
245 250 255 

Asp Asn Ser Ala Glu Asn He Ala Leu Val Thr Ser Met Tyr Asp Gly 
260 265 270 

Leu Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp 
275 280 285 

lie Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu 
290 295 300 

Gly Lys lie Glu lie Phe Arg His lie Leu Gin Arg Glu Phe Ser Glv 
305 310 315 320 

Leu Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val 
325 330 335 
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Arg Val Ser Leu Tyr Asp L u Ala Ser Val Asp Ser Cys Glu Glu Asn 
340 345 350 

Ser Val Leu Glu lie II Ala Phe His Cys Lys Ser Pro His Arg His 
355 360 365 

Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp 
370 375 380 

Asp Leu Leu lie Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu lie 
385 390 395 400 

Tyr Met Phe He Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys 
405 410 415 



Lys Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu 
420 425 430 

Leu Thr Gly His He Leu He Leu Leu Gly Gly He Tyr Leu Leu Val 
435 440 445 

Gly Gin Leu Trp Tyr Phe Trp Arg Arg His Val Phe He Trp He Ser 
450 455 460 

Phe He Asp Ser Tyr Phe Glu He Leu Phe Leu Phe Gin Ala Leu Leu 
465 470 475 480 

Thr Val Val Ser Gin Val Leu Cys Phe Leu Ala He Glu Trp Tyr Leu 
485 490 495 

Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr 
500 505 510 

Tyr Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val Met He Gin 
515 520 525 

Lys Lys Ala He Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys 
530 535 540 

Arg Lys Lys Gin Arg Ala Gly Val Met Leu Thr Val Gly Thr Lys Pro 
545 550 555 560 

Asp Gly Ser Pro Asp Glu Arg Trp Cys Phe Arg Val Glu Glu Val Asn 
565 570 575 

Trp Ala Ser Trp Glu Gin Thr Leu Pro Thr Leu Cys Glu Asp Pro Ser 
580 585 590 

Gly Ala Gly Val Pro Arg Thr Leu Glu Asn Pro Val Leu Ala Ser Pro 
595 600 605 

Pro Lys Glu Asp Glu Asp Gly Ala Ser Glu Glu Asn Tyr Val Pro Val 
610 615 620 



Gin Leu Leu Gin Ser Asn 
625 630 
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